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[ffifc^gt] [PROBLEM TO BE SOLVED] 

ttrollM^WtSiS It has the activity which controls the 

U ia?iJ#-^ 1 morphogenesis of a plant, 

5 /^Sc^iJXfi-WT 5 / SSIE it transforms a plant by substitution of one or 

Wz.te\^XWfo<nj&Wfc'&%:'$\ two or more amino acid residue which does not 

5St£f-I£W£-§-;i&^l affect the activity which controls the 

Xf4 2^,±»T 5 J W%^<r>W morphogenesis of a plant in the amino acid 

t&s ^^c&SVMilf A£r^Tl~-5 sequence shown in sequence number 1 or this 

7 5 /Stlc^J^ir^ W amino acid sequence, the gene which codes 

K-f -SiStfSiK protein including the amino acid sequence 

r cDjt{5^-{i*f-f"6T>'^"fe^ which has deletion or insertion, or DNA which 

^RNA £ , 3§?|,'f"-5D N A T*W expresses the antisense RNA to this gene. 




T-DNA Jf Agflfft: T-DNA inserting portion 
'fyfny: intron 
^ y^: Exon 



9/25/2006 



4/74 Copyright (C) 2006 The Thomson Corporation. 



JP9-56382-A 



THOMSON 

*- 



imm*<Dmm] [claims] 

ummi) [claim 1] 

U ! tiO<nMWiM}& : kffl'0-tZ>%i DNA which has the activity which controls the 

tt^U I2^'J#^1 \z.^rT morphogenesis of a plant, and codes protein 

5 J ^Sc^iJXii-^T 5: y &iB including the amino acid sequence shown in 
W^^XWh^WtSkWM'kU sequence number 1 or the amino acid 
Wt~ ^fB^^BW^^-^.^^ 1 sequence and has the substitution, deletion, or 
Xfi 2 £X±.<DT X J WB&<OW. insertion of one or two or more amino acid 

&ifc*>5 VM4J?A&W1"5 residue which does not affect the activity which 

T$ J SfciB^JSr^tf* St controls the morphogenesis of a plant in this 

— Ki" ; 5DNA 0 amino acid sequence. 

'[!*** 2] [CLAIM 2] 

ft^illfeftcoDNAW^^ DNA which codes the antisense RNA which 

?rW]^J"f 5 7 V*f"t. y^RNA controls the expression of DNA of Claim 1 . 

6 = — KtoDNA. 

[lit** 3] [CLAIM 3] 

iE?!l#^- 1 i2<fctf>*&£Sfi?iJ<£> DNA of Claim 2 which has a substantially 

/>£< fc— gfB^fCftjfctUS complementary base sequence in at least one 

ftjft&SSil^J&W I"*!** 3 ® 2 part of the base sequence of sequence number 

letfe^DNAo 1. 

[»#JI4] [CLAIM 4] 

SftjftJg 1 SBflK 0> D N A T'^K The plant body transformed by DNA of Claim 1 . 

[Sft#S5] [CLAIM 5] 

tftjfcJS 2 4 fcti 3 fetfeWD N The plant body transformed by DNA of Claim 2 

A X*l$m&$k £ thtcmVoW-o or 3. 

[»**6] [CLAIM 6] 

f ft jfcJg 1 IHfc <Z) D N A<D^§?t It is DNA which controls the expression of DNA 

£rfrH$1""<5DN Atfcot, IB of Claim 1, comprised such that DNA which has 
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yiJIMI- 2 £>t£g#-^ 1 ~ 1 7 5 at least one part of the sequence expressed 
2T*^£tL5iE? l J<£> / >&< 1 1> with the base number 1-1752 of sequence 
5DNA 0 number 2. 



[DETAILED DESCRIPTION OF THE 
INVENTION] 



[0 0 0 1] 

rti k <D D N A X*J& $ 



[0001] 

[TECHNICAL FIELD OF THE INVENTION] 

This invention relates to the plant body 
transformed by gene DNA which controls 
ecology formation of a plant, DNA which codes 
the antisense RNA with respect to the gene, 
and these DNA, it offers the technique of 
adjusting the length and the inflorescence form 
of a stalk of a plant. 



[0 0 0 2] 



[0002] 



[PRIOR ART] 

It is thought that the plant form is influenced by 
the genetic factor and an environmental factor. 
There is a method of changing the form of the 
useful race by crossing with the plant which 
differs in those form hereditarily as one of the 
procedure of formerly producing a plant with the 
short length of a stalk, and the plant from which 
the inflorescence form varied. 
However, it is difficult to obtain the solid which 
excelled the parent race in quality by being 
stabilized. 



[0 0 0 3] [0003] 

a «3*«^«Pr Moreover, it is although variation takes place to 
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m^M^X. V the gene which affects the form by the natural 

*.3Jt-{5-?-K:2Eft;& J iS£ <9, mutation or mutagenesis processing and the 

(Djte^F-ifilg^fSTI - ^- k£. solid which shows a form variation when the 

X^XBW^it^^i'Bi^-^M gene function falls can also be selected, it is 

fai-&Zbh~*Ifl£X'foZ>&, % very difficult to produce as desired the solid 

ffla^^K^^L/c**, # which variation generated only in the gene 

fe<DffiW&$.*M'M-fZ> : &fc I f- which controls a specific morphogenesis, with 

1£\1t\z.$t$k1fift£. LtzBfc&fc the useful inherited character conserved. 

[0 0 0 4] [0004] 

ZZX^fDftMBf&ZMffllr Then, it is if the gene which controls the 
Silted £¥HI1" -5^ morphogenesis of a plant can be isolated, by 

tilt, tikfeBtttt-? Z>T ^7- transforming a plant by the vector (antisense 

ty^RNA £3§ 5 =t 0 tc RNA expression vector) integrated so that the 

£fl^iAA//'c v <^ (T^^ir antisense RNA with respect to this gene might 

y^RNA^ig^??—) XM be expressed, the length of a stalk is 

%A%W&W8k't'& Itl^J:^ considered to become possible to produce the 

loS £ < ftofcftM^&fE plant which became short, 
ft -f -5 r ^ Riim- ft 5 £ % *i 

[0 0 0 5] [0005] 

iyn^^^-X^- (7 7k"K7 t v / Arabidopsis thaliana (Arabidopsis thaliana 

7s ■ f !) 7t (Arabidopsis (Arabidopsis thaliana)), the elongation between 

thaliana)) fi> S^ffi (vegetaive the nodes (internode) which follow the transition 

stage) i^flil (reproductive to a regeneration stage (reproductive stage) 

stage) ^<Dfe&lcl£%-(DW%:% from a nutrition term (vegetaive stage) with an 

{^i^ ifcD^j&S.U^tU-^ elongation of a floral bud at formation and it of 

fiFa] (internode) (DW&\t 1 &< the bud is strongly in cooperation, the branch 

jlglLT&'Q % Jft$Bt£MSflF<5tt method (highly ordered branching pattern) 

ktifc^lfrlfl ^ (highly ordered which was able to be strictly set in order is 

branching pattern) &r^i~ 0 i/ shown. 

as( 5?fXt©WWfti3^ Landsberg erecta (Ransburg erector) strain 

-f 7° (ecotype) i LT^P^tL-C known as standard ecotype (ecotype) of an 
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^ -5 Landsberg erecta Arabidopsis thaliana conserves endogenous 

/<— >f . 30/?$) Wit, rtH erector (erecta: dwarfing) variation, different 

ftii/^ (erecta : t>\> Mb) floral-bud structure is shown. 

LTJo^ Mft It forms in a top part the compact floral bud in 

mnfeZ^-fo WR\zftifiQS&. which a flower grows thick, 

fjay/^ h t£fc%- %Mf&1r Variation is pleiotropy-like (pleiotropic). 

5 o f # ffi i§ S 6tl It has a circular leaf and the short flat ellipsoid 

(pleiotropic) Tfc 9 > RJf^wH (silique) (summary announced above at Huang, 

TOft^ (silique) £W Let al. (1991), and the University of California at 

-f 5 (&±, Huang, I. et al., Los Angeles keystone symposium). 
(1991) UC L A^ — * r- — V 

v- y *° V a T- % a £ ft fc 5 
ii)o 

[0 0 0 6] [0006] 

Landsberg erecta ftwf Moreover, the dwarfing variant by which having 

MirlWI— S^-f-M^^^^Wi - the variation of a Landsberg erecta strain and 

5: ^itfc^^J^ftlkft"^ the variation of the same locus is known 

5MMb2£i&#;0 s SX#£ft"C*5 genetically is acquired, it is named er-101 

•J, er-101 ttc, er-102 strain, er-1 02 strain, and er-1 03 strain. 
er-1 03 W t $ ft X V ^ . 

[0 0 0 7] [0007] 

[»W*S#«fctJ:5i-f5« [PROBLEM TO BE SOLVED BY THE 

|g] INVENTION] 

±3iELfc J; 5 fit^GDibWL' As above-mentioned, the concern between 

^^coif-S^f- <t <7DP#fi, dwarfing of a plant and a specific gene is known 

5iSgiWl-^^tiTV^ somewhat experientially. 

a> , * CDitfii 1 £ ftfi^fcljtft However, the gene itself is not yet acquired. 

$ft"Cl*ftV\ 

[0 0 0 8] [0008] 

#$SWtiU ±E<Rjfe*»?>*;Sft This invention was made from the 

OTfc 19 , ffi^5«0^tl?fM above-mentioned viewpoint. 

£frJ$P^5Sfc^£iiW1-5 - It makes into a subject to offer the gene which 
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bZWkMt-fZo £fc, controls the morphogenesis of a plant. 

'f-XmZi&fcT^ttlrZTy?- Moreover, it transforms a plant by DNA which 

ty^RNA5-3-Kt'5DN codes this gene or the antisense RNA with 

A XWfo**foW&& t % respect to this gene, by promoting or controlling 

i$*ffl'0i-Z>jAfcT<D%%l$:iS: the expression of the gene which controls a 

MhZ^^ttfflM-tZ^ ilcio morphogenesis, the length of the plant which 

X, ^Wft £ < ft o fcffii#J the length of the stalk got long, or the stalk also 

XllI(Dg^>I< ftofcfit makes it the subject to offer the plant which 

h&^^trtFFMMtD&ikL became short, or the plant from which the 

fcffi^^tl^l"^ r t hMMt inflorescence form varied. 

[0 0 0 9] [0009] 

mM&M&l-ZtzVXD^m] [MEANS TO SOLVE THE PROBLEM] 

±faglft£M;1~ ; 5fc&K:, ^ In order to attain the above-mentioned 

3§f£l# h It *s d 4 Zi'X't<D$k objective, the present inventors clones the gene 

MDNA*^iti»^lM which controls the morphogenesis of a plant 
^rfjiJWi" Sitfe^-^ ? p— from the chromosomal DNA of an Arabidopsis 

^m.in^^B.^AytcT thaliana, after obtaining the antisense 

>"f-±y^^^^<^ ? — expression vector incorporating this gene, it 

^ i^^^-DNAtif^ finds out that the form of a plant solid 

%kLtzW.yoW(VMWift^iti- transformed by this vector DNA varies, and 

5 r t £ Hi ^tc U came to perfect this invention. 

[0010] [0010] 

-f ftfr^SP^ti, W.W)<DM1& That is, this invention has the activity which 

^j#£f&J$Pl-<5?£tt£W U lc controls the morphogenesis of a plant, it is DNA 

£|J#^ 1 (c^-f T ? J ^Sfi^iJX which codes protein including the amino acid 

itZCOT 5 /MMi^zio^^xU sequence which has the substitution, deletion, 

^<DMW t MJ&%ffl / M~f <5St4{- or insertion of the amino acid residue more than 

1X112 Sk±.(D 1 or 2 which does not affect the activity which 

T 5; / l£Slg60®&> {K&hZi controls the morphogenesis of a plant in the 

VM4Jf ASrWI"" 57-; yMa^J amino acid sequence shown in sequence 

£^tf^ W^W^— Ki"5 number 1 , or this amino acid sequence. 
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[0011] [0011] 

*fc>W&l!Efi, ±fSWDNA(D Moreover, this invention offers DNA which 

$£$l&$\iM-fZ>Tl/ : 3"£yz>R codes the antisense RNA which controls the 

NA£=i— Kt5DNA$:li expression of above-mentioned DNA. 

-f-So rcoDNAi: LTii, IB As this DNA, DNA which has a substantially 

^#77 1 fattGDt£l=IS£iJ<7}^& complementary base sequence is mentioned to 

< t h— WcHWft^fgliftjft at least one part of the base sequence of 

ij&SIfi?iJ£W-f 3 DN A^W sequence number 1. 



[0012] [0012] 

*3gf^{i£ bfc, mtZfy^? This invention offers the plant body further 

K£r=i— Kt^DNAtiffe transformed by DNA which codes said protein, 

^£tifcfit$5#^ RXfmtZTy and the plant body transformed by DNA which 

ft^^RN ASr^ — K1~ SD codes said antisense RNA. 

n a x^mmm z mzmw-^ 



[0013] [0013] 

coDNAtcj; — K $ The protein coded by DNA of this invention has 

ft 5* if©fft; many expression levels in a stalk with a 

ri^Li/^ TSftifTOlgJUl remarkable variation of a character, a flower, 

< > fc®<»l&ffl&WL<oWto. etc., and is protein in connection with control of 

^^©tofifcHfrS*^'** the morphogenesis of a plant, especially an 

M"C£>>9, :©^y/^fi5:3 elongation of a stalk. 

— K^SifrlE^f-'CW'&Sr^ftte It transforms a plant with the gene which codes 

&U Mfc^-W^S^M this protein, it is anticipated by making the 

HSritiot, expression level of this gene increase that it can 

~>k%MX*% k1£W&£fo promote an elongation of a stalk. 

5o 



[0014] [0014] 
— ft, iE/yfty^RNA It is anticipated by on the other hand 
£ a — H"f5 D N A, -ft£t>h transforming a plant by DNA which codes said 
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m^<^S«W$y»M1"5 antisense RNA, i.e., DNA which controls the 

Min^fD^^^Mir SDN A expression of the gene about control of the 

-C*t#>£^W$E&1-£ r t \z. J: morphogenesis of a plant, that it can control an 

^ MU&;$M !! fo(V^<nW9:% elongation of the stalk of a transforming plant. 

Ml" -5 r t 5 r. t tm 

[0015] [0015] 

ft, ^^(-*5V^T % r^fefls; in addition, in this specification, a 

DNAj RTf tQ-QW-Mfel-} "chromosomal DNA" and a "chromosome gene" 

tt, WWM&(Dm%k&m^$. say DNA contained in the discharge-printing 

ti5DNAWrODNA±i: color object of a plant cell, and the gene which 

HFtfE-rSite^fcv^,, exists on this DNA. 

#3PJ!U:«fc Moreover, the gene about control of the 

ftmMf&vUM^I'&Ate* morphogenesis of the plant offered by this 

SrJillJilifJ^fcli^ invention may be called a "form regulatory 

3BUi0>iH5-7\l i ^ 5 r. £ gene" or "gene of this invention." 

[0 0 16] [0016] 

l%m<D&1HtoteU} [EMBODIMENT OF THE INVENTION] 

UT. ifW^mm^m^U Hereafter, it demonstrates Embodiment of this 

9H"3o KWR&M.fc+tes M invention. 

*.fi» ^BM^fliiJ^f-MI"^ The gene of this invention is acquirable from the 

§£M£W"f SW^ftfr^ ^ plant body which has the variation about control 

^WW^t-Hfr^^fifc of a morphogenesis by isolating the variation 

^^r^fifi-S^ir^J; <?Bx#1~ gene in connection with the phenotype 

3 r i: j&s-C# -5 c $ {-> variation, for example. 

ftfc^MS'lK^-W&gifi^Jt-S Furthermore, hybridization which uses the 

•^TfNRLfc^y =*'3*^ \s1r oligonucleotide produced based on the base 

K5r7°n— yti" 7" y sequence of the obtained variation gene as a 

^Vif— ->a >\ fc<5vMi3£M probe, or a wild-type gene is acquirable from 

it-fS^ro&SE^Jl-K^Tfl 2 the chromosomal DNA of a wild-type plant with 

fiU:lJt©t!i^^^tf PCR (Polymerase Chain Reaction) which 

K £ 7° 7 "f ^ — itSPCR makes a primer one pair of oligonucleotide 
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(tK y t 7— if • y ' V produced based on the base sequence of a 

Tfi/sV) \z. J: <9 % variation gene. 

[0017] [0017] 
B1BMf&<DM@NZ-mir It demonstrates in detail production of a plant 

^i"5fiS#J#0>fN&% - £>l£M body which has the variation about control of a 

W.frbfD^WR&Afc^vyM-W. morphogenesis, the method of isolating the 

ft, St^^^^it^^ O^Jffl gene of this invention from this variant, and the 

ft£M(-tftW1"-5o ft> DN directions of the gene of this invention. 

AW^Olr, illg, MKU^, it In addition, general procedure required gene 

fc=?-(Di&mm<D$kfe. '^"7 recombinant, such as cutting of DNA, 

y #4 if — i/ 3 Ix^itfE connection, transforming, a decision of the base 

: f-*&*;Lfc&g&#ifc»i, sequence of a gene, and hybridization, it is 

f^(d^ffli-5rfj|5<Z)^^[c^ indicated to the description attached to the 

H & ti X & Vfi 9 ^ * enzyme of marketing which it uses for each 

Molecular cloning (Maniatis T. operation etc., and Molecular cloning (Maniatis 

et aj. Cold Spring T.et al.Cold Spring HarborLaboratory Press). 
HarborLaboratory Press) IdfEife 

[0 0 18] [0018] 

< 1 >fc&Kr>i&teM&&fflffl~r An isolation and identification of the gene which 

5 jifS^co^-gl • lU^ controls the morphogenesis of a <1> plant 

(1) J&MB&.toWlffliz.ffl-fZ) (1) Production of plant body which has variation 

|£M£W"f -SttrtfefrOf^Ul about control of morphogenesis 

flt^j, Wttf^cef^t^t© In order to make the gene which controls the 

^ffiJBfifc&fMW'f'Stt^fcaE morphogenesis of a plant, for example, an 

g%f&^ £i±-5(cfi, ^3£ji{S Arabidopsis thaliana, cause variation, it 

T-^IiiWSCiAl, transduces a foreign gene into a plant cell, by 

DNA (cf^A cftoiiCio making it insert in a chromosomal DNA, it uses 

f , #A»»ifi^^Wt the mutagenesis method (gene disruption) 

33£Htfl3Bi£ (min^i&M) & which destroys the gene of the insertion site, 

fflv^o ilffl-Ct 5iH5^-#A As an applicable gene-transfer method, the 

SfefcLTfi, T^n^^f li"? procedure of using an Agrobacterium, the 
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A&ffll^3#i£ J $\ Ift/n F electroporation method with respect to the plant 

-fy-^ MMSfc*t"f*5*l''* Y protoplast cell, the poiyethyleneglycol method, 

p tK— V— > a /^!Jif the microinjection method, etc. are mentioned. 

]) =1— -?4>7uy( The procedure an induction of the height of 

Vi?*.? i/B yflxfcif&lglf h transforming effectiveness and variation other 

tbS 0 Z.tlb<DjJW;<D 5 *s than the variation by a gene transfer uses an 

uyi%-f%-f KftLXitMUfe Agrobacterium from few points to an 

$:?)}^<DM£ mfe^MAl^ Arabidopsis thaliana among these procedure 

kZ^SlfttD^fDm&fr'P are effective. 

[0019] [0019] 

s^^l-V —^<? 9— Here, it is a binary vector type (T-DNA which 

^ (ffiMJJ&i-DNAJ»ARlfE can carry out a DNA transduction at a plant 

/iT-DNA, *lirafc£r0>lR cell). 

^.W^W^iWiWS^^ R It transduces a foreign gene into a plant cell by 

T/#?£ L< f*ffi#J*IBJ&*fcfil& the Agrobacterium infecting method using the 

^^IfflflScoiltRfflco-r— ^7— it replication starting point which can function by 

fc^&lstf^f £rfflv microorganisms, such as an Escherichia coli, 

tzT ^n/^f!i ^A^^&f- and the vector type which preferably contains 

ioT^-^itfS^-^IS^ilfflS&t- the marker gene for a choice of a plant cell or 

|AL, &MBf&%ffl'01-Z>m the microorganisms cell, how to produce the 

fc^M-f saSJWMfcfcftslW" variation plant about the gene which controls a 

-5^ffi^^1~o morphogenesis is shown. 

[0 0 2 0] [0020] 

i/n^f ?fXt (Arabidopsis To the wild strain of an Arabidopsis thaliana 

thaliana) ©If&HcK^ Ti 7° 7* (Arabidopsis thaliana), it is a binary vector 

^ Y^%co/<4 •)-]) — P— derived from a tumor inducing plasmid (as a 

(-v— 1j — i: LT, #|x.fiV^ marker, for example, it has a hygromycin 

jfu-rj v^fttiitfE-i^Rt/T resistant gene and an ampicillin-resistance 

y t°-> y ^imafe^^^Ti" gene) 

5)^,-rv7°7>^(in planta) You make it infected by the implanter (in planta) 

T if p * 9 7- V 7 Agrobacterium infecting method (Chang S.S., 

(Chang S.S., Park S.K. et al. Park S.K.et al.Plant J.5, No. 4 (1994)) etc. 
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Plant J. 5, No.4(1994)) 

fitt)£ p 3/ ^ V—Mz.&m IT 

(erecta m&m &uitmm 



It gathers the seeds after about six-week 
raising. 

It seeds the obtained seed to the agar 
containing a hygromycin, it makes it grow. 
It transplants to rock wool the transforming plant 
in which hygromycin resistance is shown, and 
raises it, the elongation state of a stalk chooses 
visually different variant (erecta variant) 
compared with a wild-type plant. 



[0 0 2 1 ] [0021] 

r o LX&bfriZMI&iM, fit The variant obtained by carrying out like this 

%<DMBMl$.&fflffli'&Afc7- has high possibility of having varied when a 

\z.y<-< 1- V ? ? — # s !f A"f binary vector inserts in the gene which controls 

Sr. b\z.&oX$E&LX\/'*Z>*I the morphogenesis of a plant. 



[0 0 2 2] [0022] 

( 2 ) j&ffl&fR^to&MB? (2) Isolation of variation gene of form gene 

£>^|§ From the variant from which the form obtained 

±&<D& o \z. LXftbtiZBM by performing it above varied, it isolates the 

(D^ikLtclg.gifcfrbs variation gene in connection with the variation 

^mfrZ^MMfc^Z, v^fr what is called using a plasmid rescue method. 

$5 7°-7 $ Ku^df-a. — ffi^r That is, it prepares a chromosomal DNA by Cell, 

JB^T^it-f 5o 1"^t>*>, £ the procedure of 35 (1983) p.35, etc. from a 

J&UyO'lfcfrb Cell, 35 (1983) variation plant body, and cuts by a restriction 

p.35 fEicW^ffi^T^fe^D enzyme, it connects intramolecular terminal by 

NAZmmLXMmmmX'Wm self ligation. 

U t^7 7>fy-V3 If the obtained circular DNA contains the binary 

0 5>^ftc/)*ffilRl±£ii#p"t" vector inserted in the chromosome, this DNA 

-5c #k*i/fcgfcl£DNA# % ^ molecule will function as a plasmid which can 

&ffc(c}f A^ft/c^-f -J-y — ^ carry out autonomous reproduction in the 

9 9 — £r-a V ^tU3\ :©D Escherichia-coli cell, a transformed body shows 

NA^fiAMM-TrgM marker medicine (for example, ampicillin) 
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[0 0 2 3] 



[0023] 

It transforms an Escherichia coli by the 
above-mentioned circular DNA, by collecting 
recombinant plasmid DNA from a resistant 
transformed body to a marker medicine, it can 
obtain the chromosomal DNA fragment which 
contains a form regulatory gene with a binary 
vector. 



[0 0 2 4] 

!J-/1^T-DNA^d- 

£ W-f 5 * n - * £ 5 r. 



[0024] 

Or it prepares a chromosomal DNA from a 
variation plant body, it connects with a plasmid 
or a phage vector, after cutting by a suitable 
restriction enzyme, it produces a chromosome 
library by transforming an Escherichia coli now. 
Also by selecting a clone by using T-DNA etc. 
as a probe from this library, it can choose the 
clone which has a variation gene fragment. 



[0 0 2 5] 

P 1 7 



[0025] 

(3) Isolation of wild-type gene of form regulatory 
gene 

From the Arabidopsis thaliana of a wild type, it 
produces a chromosomal DNA library using P1 
phage vector etc., by the hybridization which 
uses the variation gene fragment obtained by 
performing it above as a probe, it can isolate the 
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t i~ 5 W rf y ?4 J i— is a V wild-type gene of a form regulatory gene. 

(cJzoT, MMUWMBl-om Moreover, based on the base sequence of a 

£Ma{S^-£M1~'5-£^tti variation gene, it produces a primer, also by 

£fc, «MW4&S amplifying a wild-type gene from the 

gfi^ij^S^V^"C7°7-Y-^— £{1= chromosomal DNAof a wild-type plant by PCR, 

§£U PCRti 1 ? W^MMtfo the gene of this invention is acquirable. 
(D^&fc D N A h W^MMfc 

3 r t iz. x o x h > 

[0 0 2 6] [0026] 

^temSfilTlt^tLfc^n-l'^ The base sequence of the DNA fragment 

l-X^mZMMW-AfeT-^te containing the form regulatory gene of the 

DNAUftjt<7)t&SK£'J£, Ifi^'J Arabidopsis thaliana obtained in the 

^iS?iJ#-^- 2 ItZTfk-t 0 ^(DjlfH after-mentioned Example is shown in 

iHi, 2 7 fl§(£>3i ^ y :/ (exon) sequence-table sequence number 2. 

t 2 6fM<D4 y h n y (intron) This gene contains 27 exon (exon) and the 26 

£-aX"trl % 5 0 introns (intron). 

[0 0 2 7] [0027] 

±12 (D J: 5 \z. y ^IS^cDjtfe^f- As mentioned above, the gene of this invention 

H#fcco^ yho ^ A^T*^ contains many introns. 

5 o ^ yy^\ What is sufficient is just to isolate cDNA of a 

mtnfcWMf&ZM'MI-Z?^^ form regulatory gene, in order to obtain the 

^^-KtSDNA^IIS exon part, i.e., DNA which codes the protein 

tr.fi, BWMffiAfc??) c D N which controls the morphogenesis of a plant. 

A^M-ffrff cDNA A cDNA library extracts mRNA from the 

y^Zfy V—fe* ->P^f?tX above-ground-part organization of an 

1-(DM±.%V,%Bi$ifr fjmRNA?: Arabidopsis thaliana, it compounds cDNA using 

ffiffiU iSls^^StSrffl^T c a reverse transcriptase, it inserts in a vector 

DNA^fiEU 7Ky>7— ^ what was double-strand-ized according to 

Sftl-ioT 2^m.iklstzh(0 polymerase reaction, it is producible by 

£^ l, n ^B§^^ transforming an Escherichia coli etc. 

^IffefttS:^:^^^ Since the cDNA cloning kit is marketed, it is 

tS^t^t'^o cDNA^ sufficient to use these. 
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u—=-y?*y h^rtnK$tb"C It obtains a form regulatory-gene cDNA clone 

t^5©"Ci*lbSrftffiLT'bJ: from the obtained cDNA library by using a 

V \ # b tL7t c D N A y -Y 7" 7 chromosome gene as a probe. 

-y<h LTllfflMef c D 

[0 0 2 8] [0028] 

±f£Oj; 9^L-C^faHffifiJT" The base sequence of cDNA obtained in the 

'&btitz c DN A(Dig£H?iJ, after-mentioned Example as mentioned above 

&t>*~ <£>ifcX@c?iJ £*i and the amino acid sequence presumed from 

57 * j ^Ifi^iJSr, Sfi^iJSSS^'J this base sequence are shown in 

1 (CTF-fo ^Witfc^F-Wffl sequence-table sequence number 1. 

IRgi^Jfi, Nature, 345 (1990) The translation production of this gene showed 

p.743 (dfeit^ftTV^ RLK5 RLK5 and homology which are indicated to 

irffiRttSr^Lfc. RLK5 Nature and 345(1990) p.743. 

ite^-fi, MSLbl-ttl'S This RLK5 gene was isolated as a protein 

ut^-iro/pf-fy^t kinase like a receptor which exists on a 

— Mb bT^lH&ftfcJ^ cytoplasmic membrane, 

coy p =r4 y^-f—HMfc^O However, homology with the base sequence of 

&£IE?iJ<i:<£>ff iRlttKlJ: 9 a known protein-kinase gene isolated, and the 

£titch<DX\ ^(DitLfcT-^M gene, function of a translation production, etc. 

RRH^WttUBft blt£<'ffflX- are completely unknown. 

fc5 0 :o RLK5 iH5-7-<Z>36!S The expression pattern of this RLK5 gene 

— yfi, Mffl&tiHDfflftfc differs from the gene about control of a 

p-f Safc^fiMft 1 ^ iffi-t morphogenesis, it expresses not only by an 

< > la-CkSSH LT above-ground part but by the root, and both are 

*3 0 Mtfj fc functionally considered to be a different gene. 

[0 0 2 9] [0029] 

< 2 > ^IHf&JflPJUS^f-tf^Jffl Utilization of a <2> form regulatory gene 

&%W<Dfefc?\t. fflto<r>W& The gene of this invention is a gene in 

J&&.<Dfflffl. <&l^<Di%M\zM connection with control of the morphogenesis of 

^-5ilf5^-T'fo 9 , w Wjt^T- a plant, especially an elongation of a stalk. 

"eWtt&J8WE!fcU It transforms a plant with this gene, it is 
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<D&i%-Mkt:i%tiU&'&Z>Z.bK£. anticipated by making the expression level of 

ot, ^©#flrSrfJEiiT*§ Sr. this gene increase that it can promote an 

ft 5 o elongation of a stalk. 

[0 0 3 0] [0030] 

Srffl V^TffiW What is sufficient is just to transduce DNAinto a 

TI0ftlE8W" i 5fcfi% Yv protoplast by procedure, such as procedure of 

zftu— i/av (fE^#j|p?L?£) utilizing the tumor inducing plasmid of 

fo-SVMiT^o/^x !i •> A(D electroporation (the electric drilling method) or 

T i 7° 7 * 5; K^^Jffli"^>^& an Agrobacterium, in order to transform a plant 

^MftCiot, 7°n using the gene of this invention. 

7* M-DNA^^Ai"ti(i <fc In that case, as a gene of this invention, it may 

\t\ ZOB, &36W<Ofefc : f-k use a chromosome gene and may use cDNA 

LXit, ^MfS^fflV^T prepared from mRNA. 
fcJ:V*U mRNAA^ISL 
fee DNA%m^Xh£\<\ 

[0 0 3 1 ] [0031] 

— ^ ^^^(DMfc^ RNA which has a complementary sequence on 

ffl]$iJi"5 T yf-± R N A, the other hand in the full length of the antisense 

i~t£hhMMfflffli&fc s irfr c D%z RNA which controls the expression of the gene 

^£ft<5mR N A(D±-g:X\tZ of this invention, i.e., mRNA transferred from a 

<D>pt£. < b k-£lU-ffiffilftfria form regulatory gene, or its at least one part, by 

^iJWSRNA, i k~*&&t%> transforming a plant by expressing DNA, it is 

DNAt<fift^iS!teito- anticipated that it can control an elongation of 

b \z X V > J^SffelfeflS^W^® the stalk of a transforming plant. 

[0 0 3 2] [0032] 

Ty'f-tyxRNA&ft^T'b DNA which expresses an antisense RNA is 

DNAfi, Tyfty^i (-t obtained by connecting an antisense strand 

y*M{^— KfcOfctllifitofcJfi (strand which has a complementary base 

g|B£iJ£Wi~'5#0 Xtt'XC < sequence in a sense strand (coding strand)), or 

b h : t(D—^^-f^^—^—<D an at least one part of that downstream from a 

Tmz-Wfe-fZ w i K: «t 0 # promoter. 
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m 0 n^&*-tit£s 

fcEJ&J £^tr 2 « D N A £ , 
« w^(D|p] § i iSS [r] # L 

#W«(^P^> mRNA© 

3' mmztttiazftztfv Am 

M/i*!) dUSr = -Ki" 



In other words, it is obtained by making into the 
direction and reverse direction of original 
transfer double-strand DNA including a 
sequence homologous to the coding strand or 
its at least one part, and connecting it 
downstream from a promoter. 
In addition, an antisense strand is also obtained 
from a chromosomal DNA or cDNA. 
However, when using a chromosomal DNA, 
since the intron cannot anticipate achieving the 
function which controls the expression of this 
invention gene, it is desirable to use the exon 
part. 

Moreover, as at least one part of an antisense 
strand, it can use either a coding region 5' 
untranslated region or 3' untranslated region. 
Furthermore, in addition to 3' untranslated 
region, DNA which codes an antisense RNA 
may include the sequence which codes 
complementary poly dU in the poly A chain 
added to 3' terminal of mRNA. 



[0 0 3 3] 

*mW \z&m UIS 7°p 9 
— t LTlis CaMV 3 5 S 7° 

DNA ZMWfc&'t -5 T* 



[0033] 

CaMV 35S promoter etc. is mentioned as a 
promoter who can use it for this invention. 
In order to transforms a plant by DNA which 
codes an antisense RNA, it is sufficient is to 
make it similar with transforming with the gene 
of this invention. 



[0 0 3 4] [0034] 

£ t>lu, 2Mi§li<E>jH£^£>=»— Furthermore, upstream of the coding region of 
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KW£<£>±i)ft(-te> #3§^<£>it the gene of this invention, the region which 

&T(D$£$L$:ffl'MirZ)Mt&tfl3 controls the expression of the gene of this 

£ft5 0 - <DW$<t LX\%, Sfi invention is included. 

£|J#-^ 2 cd±^S#-^ 1 ~ 1 7 5 The region which contains at least one part of 

2 T^^ixSIB^'J^^ft < t h the sequence expressed with the base number 

— gG^-g-ttfi^ i^M^I- 1-1752 of sequence number 2 as this region, 

(4t&£#-£r3 96~1752<D the region of the base number 396-1752 is 

W bfo&o ^ (D ffl$>\*. mentioned more specifically. 

fitttfflJJS^ &v ^TitiH^-O^^ it is anticipated that this region will be applicable 

MMz.Mffi-fZ) :H 5 "Ci5: to expression control of a gene in a plant cell. 

[0 0 3 5] [0035] 

immrn) [examples] 

i^T^. 3£3§f^£rHJ&W- £ "5 Below, an Example demonstrates this invention 

$ P, i^fl^icfftf^l-So still more concretely. 

( 1 ) ff2fiffM<7>ftJW-M1~5 (1) Production of variant plant about control of 

ggMflMSftfeKPfNK morphogenesis 

v/n^f^fXf (T^f'KT'v' It infected 101 strain of Agrobacteriums EHA 

X ) 3 ^ 7° WS with pGDW32 (Agrobacterium tumefacience) 

( Wassilewskija N LEHLE which is a binary vector (having a hygromycin 

SEEDS ttJ: "9flf A) t£t-> Ti resistant gene and an ampicillin-resistance 

-fy 7. 5 K&3tt<7VW "t" y — ^ gene and autonomous duplication is possiblein 

^ ^ — (/Nxf yu-?y( f y vWk an Escherichia coli intracellular ) derived from a 

Mfc^AVTyt^yV ^ftttit tumor inducing plasmid to an 

-fKi^^U A)]§Mll£ft"C*g Arabidopsis-thaliana (Arabidopsis) ecotype WS 

#:1I$i«If£) pGDW32 strain (purchased from LEHLE SEEDS Co. 

^o7^o/^fU £ Wassilewskija) by the implanter (in planta) 

EHA101 ffi. ( Agrobacterium Agrobacterium infecting method (Chang S.S., 

tumefacience) -f ^ Park S.K.et al.Plant J.5, No. 4 (1994)). 
^ (in planta) 7^p/^t!) 

(Chang S.S.,Park 
S.K. et al. Plant J. 5, 
No.4(1 994)) \z. «fc "9 JS^ $ -frfco 
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[0 0 3 6] [0036] 

ftftftdfi, £kT<D J: 0 £ Specifically, it carried out as follows, 

tfofco pGDW32 LfcT It proliferated the Agrobacterium which 

^d/^t!)!)A?;, fct£^W transduced pGDW32 overnight by LB broth 

(/N^f ^ p -r^ i/^) £<a-£j> L containing antibiotics (hygromycin). 

BMfct£i&X—ffi.i%M£'£tc 0 It diluted this culture-solution 10 microliter to 

r Wi|^if$ 10m 1 £ G a m b Gamborg's B5 medium (2% of sucrose is 

org' s B 5 igtte ( y a included and it is pH5.5) 190 microliter (1/20). 

2%£-a<^ pH5. 5) 19 It performed Agrobacterium processing 19-23 

0 u 1 (1/20) Lfco If days afterward from seeding in the growth 

1 9~23Hi, phase which just began the bolting. 

[0 0 3 7] [0037] 

f^t/^fefcfi^^ W^E^^^W In the base, it cut off the scape which just began 

g^c&V^T 1 1 -^M ^.^fflv^ to be extended using the No. 11 scalpel, and 

9 Ex ^ 26Gxl/2t penetrated the center section of the rosette 

4 X(D£ft$t3rffl^"C#J !)P^ stalk from the cut end using the injection needle 

g>p-tfy h%:<D$>$:%f>ZMM& of 26G* 1/2 size. 

-Br/Co w©i$n \z.T?nWT It implanted Agrobacterium dilution-liquid 1 

V i)AfWl /z 1 SrttAL microliter into this wound. 

fc„ 3 0 0 0 ~ In the meantime, illumination intensity was 3000 

400 0yV^^|:Lfc o i!3 -4000luxs. 

B^icfil^i^P ^^77^^- It transplants a plant to the mini pot of a lock 

(D * *j h fc^fit t \ -t £>t& fiber three days after a vaccination, it performed 

(iiif-coist^^ffo /Co 88lJ&» the usual cultivation after that. 

6 illBHiL -?-< JfM^tifc It harvested the seed from the ellipsoid of the 

^^^(DMi^^M^-^^LW. t about half formed early about six weeks after 

tz 0 the vaccination. 



[0 0 3 8] [0038] 

'&btitcM^-&. lOjug/ml (D/n It seeds the obtained seed to B5 agar 

J i/y%1£ts B5 M% containing a 10 microgram/ml hygromycin, it 

i^iHttlL, fiS22°C, ilg made it grow the optical period of the light 

30-40 %, MM 6000-12000 jv period 12 hour /, and dark period 12 hours 
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^^©SWTt, ijyW 12 B#|S]/ under the temperature of 22 degrees C, 

Bf$ 12 B£|BI<D}te^l$W"e£W& humidity 30-40 %, and illumination with an 

-frfCo ^ft^f± 1/1000 (vi#|R illumination intensity of 6000- 12000 luxs. 

L/c HYPONeX (tt±t)E OHO The nutrient used HYPONeX (Make from village 

SI) ^fflv^o /\j 4fwr4*/ choice goods) diluted to 1/1000. 

y ft £r ^ i~ pGDW32 <Z) It transplants to rock wool a transforming plant 

T-DNA if AlE^J&r&o^ffls with the T-DNA insertion sequence of pGDW32 

%MM^: o y ^ L which shows hygromycin resistance, and raises 

-CWJ$ U T4 ffi^ (jg 4 1£ftcD it, it collected T4 seed (a 4th generation's seed). 

H^-) ££HxL7c 0 U Under the present circumstances, it performed 

%m^M^T^\ ^(DW&W: the search visually and obtained the variant 

^ftW&SLWfa^VG^XgktlZ) from which the elongation state of a stalk differs 

^M^Sr#fe 0 compared with a wild-type plant. 

[0 0 3 9] [0039] 

( 2 ) ^^itfs^-co^il (2) Isolation of variation gene 

T 7 t" K ^> <D f J J* DNA It prepared the genome DNA of the Arabidopsis 

Cell, 35 (1983) p. 35 [dfSic by the procedure of the statement etc. to Cell 

(Djj ; mmz£ymmLtz 0 and 35 (1983) p.35. 

[0 0 4 0] [0040] 

-80°C-C$C^ Lfc7 7 hf K7*v' Arabidopsis organization (root) which froze in 

XffEM. OK) 5g ¥L&£ffl -80(degree C) It pulverizes 5 g until it becomes 

\i^Xifcfc&M t P"?Wtfftttzt£Z) a fine powder in liquid nitrogen using a mortar, it 

£-e$)#U ZLtl% 25ml CD is 25 ml about this. DNA It adds and agitates 

DNA lWUffi^ '777- (50mM the buffer for isolation (50 mM Tris-HCI, pH7.5, 

Tris-HCI,pH7.5, 0.2M NaCI, 0.2M NaCI, and 20 mM EDTA-Na2, a 2% 

20mM EDTA-Na2, 2% N- 7 N-lauroyl sarcosine sodium salt, and 3 g/ml 

p -f ;Hf aJ/ytMJ^ urea and 5% TE saturated phenol), 1.5 ml 10% 

t&, 3g/ml I<St, 5% TE f&fPv 7 SDS (sodium dodecyl sulfate) after adding a 

— /u) ZJNtlXMW phenol/25 ml of chloroform In addition, it 

25ml ©7i/-;W7nn*^ agitated gently at room temperature for 10 

A £fln 1.5ml <7) 10% minutes. 

SDS(KfV;uiitM)^A) After centrifuging this by 6000prm for 10 

tr^P^T 10 #Bfk ^fiTlS-^ minutes and adding a phenol/25 ml of 

;H£if#Lfco 6000prm chloroform to an aqueous layer, it centrifuged 
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T* 10 tKJSU: by 6000prm again for 10 minutes. 

25ml J)7*y-^BB*^ Immediately atter adding and agitating 15 ml 

Afcjjp&fc^ ??£ 6000prm ethanol to this aqueous layer, it centrifuged by 

*T? 10 Jfri&>i}\stz 0 ^.(DfcMlZ- 6000prm for 10 minutes. 

15ml J — MW It throws away supernatant, it adds 25 ml 70% 

LtclkB.hfcs 6000prm f 10 ethanol to precipitation, and agitates by a 

#iS'i>L/c 0 ±?t£f£"t, itJS vortex, it threw away supernatant, after 

\z. 25ml CD 70%^ y — /uSr^n centrifuging by 6000prm for 10 minutes, and it 

x. X , Otsjvf- y^^ltZX'O MW dried under reduced pressure. 

U 6000prm "C 10 fti&<bLtc It dissolved this in TE of 400 microliter, and 

'&±.\n %&Xs W&fXM^. b pH8.0 (10 microgram/microliter RNase). 
fc 0 rtlfc 400 mI <D TE,pH8.0 
(10jzg//xl RNase) |i*#L 



[0 0 4 1] [0041] 

_LfE<D J: 9 LT, MMMf&(D It performs it above, it prepared from the variant 

WCitS'IIW^lSL about control of a morphogenesis. 

200 Mg GD^feffcDNA & % Chromosomal DNA of 200 microgram It 

gcV^T CsCI j®jfi'^jfe^«t t) continues. CsCI It purified by the 

flH&L/to ^<Doh, I nq (O ultracentrifugal method. 

Vk&W DNA £r EcoRI Xfi Among these, 1 microgram Chromosome 

Xbal t?3JBrU 7^7— /wffi DNA It cuts by EcoRI or Xbal, after purifying 

ttj, oi^y — /utt®-e$URRI?3f5 a restriction enzyme fragment by phenol 

ifJt^ffl^bfc^, t/i'7 7-l' extraction and the ethanol precipitation, it 

?—->3 9 ft connects intramolecular terminal by self ligation, 

|WI±£riI^U #btLfc^^tD it transformed the Escherichia coli (XL1-Blue 

NAXM^ (XL1-Blue MRF' MRP (from STRATAGENE to purchasing)) by 

(STRATAGENE fcfrbffiA) ) the obtained circular DNA. 

^ff^K^^bfCo ^(DffiM:, %} As a result, it was able to obtain the 

1000 3P--©7Vfv'!Jy transformant which shows the ampicillin 

iH4£7jrf ^Sf$5&t*£#5 w resistance of about 1000 colonies. 

ir^"Cf f-e tt^tLfcR'tt^n It extracted plasmid DNA from the obtained 

=-i^7*7^^ KDNA&rtt resistant colony, and conducted the analysis. 
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[0 0 4 2] [0042] 

±HE<£>«£ o tdLT ux%-=l — £ It named what was obtained from DNA from 

titzzfy Y(D?hs EcoRI which it cut what was obtained from DNA cut by 

-C^rjl^f Ltz DNA fi* h fitz EcoRI among the plasmids by which the rescue 

t>co£r pREa RXf pREb, Xbal was carried out as mentioned above by pREa 

XtyWil>tc DNA frb&btitc and pREb, and Xbal pRXa and pRXb. 
t©^ pRXa RXf pRXb t 

[0 0 4 3] [0043] 

( 3 ) Mi&i&fcl- <D^M (3) Isolation of variation gene 

±12 GO J; o\L\sXV7s*=l~- £ It performed the isolation of the variation gene 

thtc7°7 * 5 KX"a £ftTV^5 from a chromosomal DNA library by using as a 

%}&i$DN AWi)i$:7 p v — 7°b probe the chromosomal DNA fragment 

tt, MftDNA7-i'77 y contained in the plasmid by which the rescue 

—fr b <D^Wmfe I! r ( nW-W.%:ft was carried out as mentioned above. 

ofc 0 P 177-^^- It uses P1 phage vector (it purchased from DU 

(DU PONT tt/^fbliA tfc) PONT), and is The Plant Journal. It follows 

£ffli^T, The Plant Journal, the procedure of 7 (1995) p.351 and is 

7 (1995) p.351 di£it<7)#i£^ Arabidopsis ecotype. It prepared the nucleus 

L it & o X , T 7 t* K 7 s v- DNA library of the Colombia strain. 

in^^T* =*uy t'Tftco^ On the other hand, it does the 32 P label of the 

DN Ay 4 y y V — ZMWils EcoRI-Xbal fragment prepared from the 

tc a — # % ±|E pRXb £ pREa above-mentioned pRXb and pREa, 

b % ti -t* ti M M t fz respectively, it performed the plaque 

EcoRI-Xbal WrK £ 32 P WM L , hybridization by making these into a probe. 

^tLb$r7°n— 7*i: LT, ^7 As a result, two positive clones are obtained, it 

— ^/N»f 7 >J #4 if — a y£ named it 28D7 and 61H10, respectively. 

fTofc 0 ^W^H, 2o(0SH4 It performed production of the restriction 

^ n — isfrftbtis Zti^ti enzyme map of these clones, and it turned out 

28D7, 61H10 Ltz 0 ^ that about 25 kb(s) and 61H10 have an insert 

txb<D? d — ^©ftiJRBS^SftfeEl derived from the Arabidopsis chromosomal 

^©££^,2807 fim 25kb, DNA of about 75 kb(s) 28D7 (FIG. 1). 
61H10 \tm 75kb <DTy t" K7° 
is * D N A £ $ <£f? AWr 
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[0 0 4 4] [0044] 

£ &Xlfl?'J#:£<£>fc#>t£ Furthermore, it implemented subcloning of the 

^ADNABfJt^SHSSS^ variation gene contained in an insertion DNA 

itfs^ © f - 7' ' ^ p — ~ ^ ^ £ H fragment for a base-sequence decision. 

M Ltz 0 Z <DU, T-DNA £Bt& Under the present circumstances, it implements 

irZM&l<Dffl$BB^i&Mk, W the restriction enzyme map of the sequence 

A Wit CO f - if yfi?#f £ Hffi L » which adjoins T-DNA, and Southern analysis of 

f?A®riT±£> T-DNA W A^fi an insert, it presumed the T-DNA insertion site 

ZmfeLtc (0 2). rco^A^ on an insert (FIG. 2). 

{fiWiff#©lte&#DNA61>FiJ£: It subcloned the chromosomal DNA sequence 

Bluescript II SK+^f-^^ o — near this insertion site to Bluescript II SK+. 

=-yfLtz 0 ibHfcWo The position on the insert of the obtained 

— XDffi AWrJt±tf>&*£EI 3 subclone is shown in FIG. 3. 

[0 0 4 5] [0045] 

(4) JF^ff^co$iJ^(cM1"-5 (4) Isolation of cDNA clone of gene about 

j1{e-7-cd cDNA 9 v — y (DWW. control of morphogenesis 

T7 \f Y'f'sx xn^^T* Arabidopsis ecotype It prepares mRNA from 

nnyif7i©I±W^^ the above-ground-part organization of a 

mRNASrlUKL, Molecular Colombia strain, molecular cloning (Sambrook, 

cloning (Sambrook, J., Fritsch, J., Fritsch, E.F.and Maniatis, and T. (1989)) 

E. F. and Maniatis, T. (1989), A A Prepare cDNA by the procedure of 

Laboraroty Manual, second Laboraroty Manual, second edition, Cold Spring 

edition, Cold Spring Harbor Harbor LaboratoryPress, Cold Spring Harbor, 

LaboratoryPress, Cold Spring and NY, it produced the cDNA library by making 

Harbor, NY) W^SCJ: 9 c D (lambda) YES (it having purchased from 

N A £ m fit t , I YES CLONTECH) into a vector. 

(CLONTECH ft^biAL On the other hand, it cuts out the sequence 

tc) H^?-i:L"CcDNA which adjoins T-DNA from said pRXb and 

y^y'y !)-4fl8Lfc.-*, pREa, it did the 32 P label of this. 

itufE pRXb t pREa b T-DNA It performed the plaque hybridization for 

^m^ir^^m 9 ttl U r 300,000 plaques of a phage library by making 

*i£ 32 P«L/t 0 these into a probe. 
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d — t Lt> 77-^7^^ In this way, it set the selected plasmid to 

7 V-<D 300,000 pKUT161. 

iIWLfc7°7*5 KSrpKUT 

[0 0 4 6] [0046] 

(5) JK1!B^fiKoiW«i^K1"5 (5) Analysis of gene about control of 

jkfcTOMffi morphogenesis 

p K U T 1 6 1 \zS t ti 5 c D It decided the base sequence of the cDNA clone 

NA^ p-yi^feftitfH^-W contained in pKUT161, and a chromosome 

tfti£E?!l&*5£Lfco cDNA gene. 

Wi&SIB^J&tFrwia^iblf The base sequence of cDNA and the amino 

££tL-57 5: /^Sa^iJ^lB^J^ acid sequence presumed from this sequence 

1 are shown in sequence-table sequence number 

SId^J^ia^J#-^2 (c^-fo Moreover, the base sequence of the DNA 

fragment containing a chromosome gene is 
shown in sequence number 2. 

[0 0 4 7] [0047] 

cDNA <D±£gIfl£iJ/^tl;££ti When the analysis of the amino acid sequence 

5 7 3. /S?ia?iJ £?Tofc presumed from the base sequence of cDNA 

t^h, rcDjffc^f-cDHIRlI^ was conducted, the translation production of 

Nature, 345 (1990) p. 743 this gene showed RLK5 and homology which 

^Eic^Htv^ RLK5 are indicated to Nature and 345(1990) p.743. 

14£^Lfc 0 * go RLK5 jftfc^- This RLK5 gene was isolated as a protein 

it, M&%M±.lz?£fei- Z> kinase like a receptor which exists on a 

^— $1 CQ7°n7M — ^ i: cytoplasmic membrane. 

LT^i££;ft/i#\ UE£a<7)7°p However, homology with the base sequence of 

V$c-Y— If itfe^^^Sia a known protein-kinase gene isolated, and the 

?|J£ cd*B|rH4[c: £ *9 fctz gene, function of a translation production, etc. 

fcrot, -eoojl^^-^ffllR^tJ are completely unknown. 

ro^ft^te^^W"^^ The expression pattern of this RLK5 gene 

r<D RLK5 fifc^-^lgJl/^— differs from the gene about control of a 
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Xi, ff^W©Bi(-lt^ morphogenesis, it expresses not only by an 
itfc^- t It&te 9 , i&±%$tcli above-ground part but by the root, and both are 
"Cft < > fittSI 0 tit functionally considered to be a different gene. 



[0 0 4 8] 

(6) temBf£<nMWKWtZ> 

pie (i) x*mbtitcBnmf& 

# ft ft IBS ?> , Landsberg 
erectaftw^u^^fg^^lll- 

nfco * r. -? * -f\ 5fe<^ 

Landsberg erecta er-103 
er-104»£ L/io 



[0048] 

(6) Analysis of variant about control of 
morphogenesis 

The variant about control of the morphogenesis 
obtained in said (1) was considered to be the 
variation of the same locus as the erector 
variation of a Landsberg erecta strain from the 
elongation state of the stalk. 
Then, it did first the hereditary complementary 
examination of the variant isolated by a 
previous Landsberg erecta strain, er-103 strain, 
and this invention. 

It becomes clear that all of these variation 
originate in the variation of the same locus as a 
result, it made the isolated variant into er-104 
strain. 



[0 0 4 9] [0049] 

Landsberg erecta tt> er-103 W It isolates mRNAfrom a Landsberg erecta strain 

j^fjmRNA&WHIU RT- and er-103 strain, it amplified by the RT-PCR 

PCR (®^?-PCR) feT* (reverse transcription-PCR) method. 

JtMBLfco "f"ftfr*>, mRNA That is, let mRNAbe a casting mould, let cDNA 

%$%Mk @fi#l#-PM iZTTk-f- which performed reverse transcription reaction 

tiUgMW&fti'Z*}) and was obtained continuously be a casting 

irf- KSr^-f t LTigflS mould by making into a primer the 

^KJ£$rfTV\ ^v^-C#btifc oligonucleotide which has a base sequence 

c D NA^iS <b L, Btiia^" y shown in sequence number 4, it used for the 5' 

=*5* * U'^-f- KSr 3 ' #J7°7-1' side primer the oligonucleotide which has a 

T>WJ#-^ 3 {d^-fifeS base sequence which shows the 3' side primer 

&I?iJ&TW1"'5^y =*5<^ i^f- and sequence number 3 said oligonucleotide, 
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KSr 5 ' 4 ^— Mffl V N T and performed PCR reaction. 

PCRKJ&SrfTofc. 

[0 0 5 0] [0050] 

^htitci^^M^O^^MtL, Let the obtained amplified production be a 

±IS^" !J =""3* ^ i/^-f- K7°7 4 casting mould, by the direct sequence method 

v-Sri^fc^l^^ FV^^ using the above-mentioned oligonucleotide 

^^&(- J: 0 > i££Sfi£'J£-$::£ primer, it decided the base sequence. 

Ltz 0 -t W IS ^ > Landsberg As a result, on the Landsberg erecta strain, the 

erecta tsfctf (4iE?iJ#-§- 1 <£>^S isoleucine residue of the amino acid number of 

2299 #£>T/&SAlc:g|fcL No. 750 had replaced by the lysine residue in 

fc<£>K:#V\ T 5 7 750 connection with T of the base number of No. 

^(04 y p 4 i/yfl&fr V i?y 2299 of sequence number 1 having replaced by 

aSl-S^LTV^c 0 er-103^ A. 

■^(iga^iJ^SB^JS^- 1 Wi^SS In er-103 strain, when G of the base number of 

^ 896 #(7)G^A icg^-f 5 ^ No. 896 of sequence-table sequence number 1 

t £ <9 ^ 7^/ g£#-^- 282 # replaced by A, the methionine residue of the 

(D y ^-y>r=- yn-fi/ amino acid number of No. 282 had replaced by 

^SUEt-ff^ Lt^fcc the isoleucine residue. 

[0 0 5 1 ] [0051] 

^ti^±X<DMM^X\ MM It becomes clear that a part of base sequence of 

Mf&(DM'i®^M1-Z>mfc?(DiM. the gene about control of a morphogenesis has 

Slfl^Jtf)— T^y^IB^iJ the variation which causes a variation of an 

O^ikZm # o amino acid sequence among all these variant, it 

r i: flSJpJH! U r (DMfc^fcM has identified that this gene was a gene (form 

mftM*®mi-Z>Mfe7- (MM regulatory gene) which controls a 

MWAin^-) -efc2>r bftfilfe morphogenesis. 

[0 0 5 2] [0052] 

( 7 ) M^it^cD^^? (7) Expression analysis of form regulatory gene 

#f It made the comparative analysis of the 

±tZ?>& b (-Lt^i^tif: expression-level analysis by each tissue of a 

cDNA % &XfK%k&fcWifc z Ftf variant plant, and the variation of the expression 

MBMf&ZM'M-fZi&fc^X'h level of the gene between a wild-type plant and 
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fc 0 Landsberg erecta ^=&Vfh 
®mAfcT<7)%$L (mRNA <D± 
H f 5 <t A/ ^"SB ft b flit a>o fc c 



a variant plant by Northern analysis as an 
another method of confirming that they are 
cDNA isolated as mentioned above and a gene 
by which a chromosome gene controls a 
morphogenesis. 

It used the cDNA clone for the probe. 
By the wild-type plant, to being observed 
strongly, the expression (formation of mRNA) of 
the form regulatory gene was almost accepted, 
and swarmed at the variant from the result of 
the Northern analysis using the Landsberg 
erecta variant and the whole solid of the 
wild-type plant. 



[0 0 5 3] 

iHS* a* £ <o w ft \z. M t> z> it& 



[0053] 

Moreover, it conducted the expression-level 
analysis of the form regulatory gene in each 
tissue of a wild-type plant. 
There were many expression levels in a stalk 
with a remarkable variation of a character, a 
flower, etc. at the variant, and the result 
confirmed also from this that this gene was a 
gene in connection with an elongation of a stalk. 



[0 0 5 4] 

7 5 2fi$ 



[0054] 

Furthermore, as above-mentioned, since the 
expression of a form regulatory gene is as 
stage-specific as a part, the expression 
controlled domain of the gene of this invention 
is applicable to control of the foreign-gene 
expression in a plant. 

The region which contains at least one part of 
the sequence expressed with the base number 
1-1752 of sequence number 2 as such an 
expression controlled domain, the region of the 
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f 3 9 6' 



1 7 5 2 cDf 



base number 396-1752 is mentioned more 
specifically. 



10 0 5 5] 

-fe R N A &m$L~f 5 D N A 



[0055] 

[ADVANTAGE OF THE INVENTION] 

The gene which controls the morphogenesis of 
a plant by this invention is offered. 
It is anticipated by increasing the expression 
level of this gene that it can promote an 
elongation of a stalk. 

Moreover, it is anticipated by transforming a 
plant by the DNA sequence which expresses 
the antisense RNA with respect to this gene that 
it can control an elongation of a stalk. 



[0 0 5 6] 

mm^r : 1 

ffi?iJcDft£ : 3176 

mtwrn : mm 
mo& : 

%m<DUm : cDNAto mRNA 



[0056] 

[SEQUENCE TABLE] 

Sequence number: 1 
Sequence length: 3176 
Sequence type: Nucleic acid 

The number of strands: It is double stranded. 
Topology: Linear 

Type of sequence: cDNA to mRNA 
Origin 



£ $9 £ : -> a 4 % 1- X -f Organism name: Arabidopsis thaliana 



(Arabidopsis thaliana) 

: nnyt'7 

wmzm-ttz^ : CDS 



(Arabidopsis thaliana) 
Strain name: Colombia 
Sequence characteristics : 
Symbol showing the characteristics: 



CDS 



SSfii : 51. .2978 



Location: 51.2978 
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CTTTTAAAGT ATATCTAAAA CTTTTAAAGT ATATCTAAAA ACGCAGTCGT 
ACGCAGTCGT TTTAAGACTG TTTAAGACTG TGTGTGAGAA ATG GCT 
TGTGTGAGAA ATG GCT 56 
56 Met Ala 

1 

Met Ala 
1 

CTG TTT AGA GAT ATT GTT CTG TTT AGA GAT ATT GTT CTT CTT GGG 
CTT CTT GGG TTT CTC TTC TTT CTC TTC TGC TTG AGC TTA 1 04 
TGC TTG AGC TTA 104 Leu Phe Arg Asp He Val Leu Leu Gly Phe Leu 
Leu Phe Arg Asp lie Val Leu Phe Cys Leu Ser Leu 

Leu Gly Phe Leu Phe Cys Leu 5 10 15 

Ser Leu GTA GCT ACT GTG ACT TCA GAG GAG GGA 

5 GCA ACG TTG CTG GAG ATT AAG 1 52 

10 15 
GTA GCT ACT GTG ACT TCA 
GAG GAG GGA GCA ACG 
TTG CTG GAG ATT AAG 
152 

Val Ala Thr Val Thr Ser Glu Glu Val Ala Thr Val Thr Ser Glu Glu Gly Ala Thr Leu 
Gly Ala Thr Leu Leu Glu lie Lys Leu Glu lie Lys 

20 25 20 25 

30 30 

AAG TCA TTC AAA GAT GTG AAG TCA TTC AAA GAT GTG AAC AAT GTT 
AAC AAT GTT CTT TAT GAC CTT TAT GAC TGG ACA ACT TCA 200 
TGG ACA ACT TCA 200 Lys Ser Phe Lys Asp Val Asn Asn Val Leu Tyr 
Lys Ser Phe Lys Asp Val Asn Asp Trp Thr Thr Ser 
Asn Val Leu Tyr Asp Trp Thr 
Thr Ser 

35 40 35 40 

45 50 45 50 

CCT TCT TCG GAT TAT TGT CCT TCT TCG GAT TAT TGT GTC TGG AGA 
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GTCTGG AGAGGTGTGTCT GGT GTG TCTTGT GAAAAT GTC 248 
TGT GAAAAT GTC 248 Pro Ser Ser Asp Tyr Cys Val Trp Arg Gly Val Ser 
Pro Ser Ser Asp Tyr Cys Val Cys Glu Asn Val 

Trp Arg Gly Val Ser Cys Glu 55 60 
Asn Val 65 
55 

60 65 

ACC TTC AAT GTT GTT GCT ACC TTC AAT GTT GTT GCT CTT AAT TTG 
CTT AAT TTG TCA GAT TTG TCA GAT TTG AAT CTT GAT GGA 296 
AAT CTT GAT GGA 296 Thr Phe Asn Val Val Ala Leu Asn Leu Ser Asp 
Thr Phe Asn Val Val Ala Leu Leu Asn Leu Asp Gly 

Asn Leu Ser Asp Leu Asn Leu 70 75 
Asp Gly 80 

70 GAA ATC TCA CCT GCT ATT GGA GAT CTC 

75 80 AAG AGT CTC TTG TCAATT GAT 344 

GAA ATC TCA CCT GCT ATT 
GGA GAT CTC AAG AGT CTC 
TTG TCAATT GAT 344 

Glu lie Ser Pro Ala lie Gly Asp Glu lie Ser Pro Ala He Gly Asp Leu Lys Ser Leu 
Leu Lys Ser Leu Leu Ser lie Leu Ser lie Asp 

Asp 85 90 

85 95 
90 95 CTG CGA GGT AAT CGC TTG TCT GGA CAA 

CTG CGA GGT AAT CGC TTG ATC CCT GAT GAG ATT GGT G AC 392 
TCT GGA CAA ATC CCT GAT Leu Arg Gly Asn Arg Leu Ser Gly Gin He Pro 
GAG ATT GGT GAC 392 Asp Glu He Gly Asp 
Leu Arg Gly Asn Arg Leu Ser 
Gly Gin He Pro Asp Glu lie Gly 
Asp 

100 105 100 105 

110 110 

TGT TCT TCT TTG CAA AAC TGT TCT TCT TTG CAA AAC TTA GAC TTA 
TTA GAC TTA TCC TTC AAT TCC TTC AAT GAA TTA AGT GGT 440 
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GAA TTAAGT GGT 440 Cys Ser Ser Leu Gin Asn Leu Asp Leu Ser Phe 
Cys Ser Ser Leu Gin Asn Leu Asn Glu Leu Ser Gly 

Asp Leu Ser Phe Asn Glu Leu 115 120 
Ser Gly 125 130 

115 120 
125 130 

GAC ATA CCG TTT TCG ATT GAC ATA CCG TTT TCG ATT TCG AAG TTG 
TCG AAG TTG AAG CAA CTT AAG CAA CTT GAG CAG CTG ATT 488 
GAG CAG CTG ATT 488 Asp lie Pro Phe Ser lie Ser Lys Leu Lys Gin Leu 
Asp lie Pro Phe Ser lie Ser Lys Glu Gin Leu lie 

Leu Lys Gin Leu Glu Gin Leu lie 135 140 
135 145 

1 40 1 45 CTG AAG AAT AAC CAA TTG ATA GGA CCG 

CTG AAG AAT AAC CAA TTG ATC CCT TCA ACA CTT TCA CAG 536 
ATA GGA CCG ATC CCT TCA 
ACA CTT TCA CAG 536 

Leu Lys Asn Asn Gin Leu lie Leu Lys Asn Asn Gin Leu lie Gly Pro lie Pro Ser 
Gly Pro lie Pro Ser Thr Leu Ser Thr Leu Ser Gin 

Gin 150 155 

150 160 
155 160 ATT CCA AAC CTG AAA ATT CTG GAC TTG 

ATT CCA AAC CTG AAA ATT GCA CAG AAT AAA CTC AGT GGT 584 
CTG GAC TTG GCA CAG AAT lie Pro Asn Leu Lys lie Leu Asp Leu Ala Gin Asn 
AAA CTC AGT GGT 584 Lys Leu Ser Gly 
lie Pro Asn Leu Lys lie Leu Asp 
Leu Ala Gin Asn Lys Leu Ser 
Gly 

165 165 170 

170 175 175 

GAG ATA CCA AGA CTT ATT GAG ATA CCA AGA CTT ATT TAC TGG AAT 
TAC TGG AAT GAA GTT CTT GAA GTT CTT CAG TAT CTT GGG 632 
CAG TAT CTT GGG 632 Glu lie Pro Arg Leu lie Tyr Trp Asn Glu Val Leu 
Glu lie Pro Arg Leu He Tyr Trp Gin Tyr Leu Gly 
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Asn Glu Val Leu Gin Tyr Leu 180 185 

Gly 190 
180 185 

190 

TTG CGA GGA AAC AAC TTA TTG CGA GGA AAC AAC TTA GTC GGT AAC 

GTC GGT AAC ATT TCT CCA ATT TCT CCA GAT TTG TGT CAA 680 

GAT TTG TGT CAA 680 Leu Arg Gly Asn Asn Leu Val Gly Asn lie Ser 

Leu Arg Gly Asn Asn Leu Val Pro Asp Leu Cys Gin 

Gly Asn He Ser Pro Asp Leu 195 200 

CysGIn 205 210 

195 200 CTG ACT GGT CTT TGG TAT TTT GAC GTA 

205 210 AGA AAC AAC AGT TTG ACT GGT 728 

CTG ACT GGT CTT TGG TAT 

TTT GAC GTA AGA AAC AAC 

AGT TTG ACT GGT 728 

Leu Thr Gly Leu Trp Tyr Phe Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Asn 

Asp Val Arg Asn Asn Ser Leu Asn Ser Leu Thr Gly 

Thr Gly 215 220 

215 225 

220 225 AGT ATA CCT GAG ACG ATA GGA AAT TGC 

AGT ATA CCT GAG ACG ATA ACT GCC TTC CAG GTT TTG GAC 776 

GGA AAT TGC ACT GCC TTC Ser lie Pro Glu Thr lie Gly Asn Cys Thr Ala Phe 

CAG GTT TTG GAC 776 Gin Val Leu Asp 

Ser He Pro Glu Thr lie Gly Asn 

Cys Thr Ala Phe Gin Val Leu 

Asp 

230 230 235 

235 240 240 

TTG TCC TAC AAT CAG CTA TTG TCC TAC AAT CAG CTA ACT GGT GAG 

ACT GGT GAG ATC CCT TTT ATC CCT TTT GAC ATC GGC TTC 824 

GAC ATC GGC TTC 824 Leu Ser Tyr Asn Gin Leu Thr Gly Glu lie Pro 

Leu Ser Tyr Asn Gin Leu Thr Phe Asp lie Gly Phe 

Gly Glu He Pro Phe Asp lie Gly 245 250 
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Phe 255 
245 

250 255 

CTG CAA GTT GCA ACA TTA CTG CAA GTT GCA ACA TTA TCA TTG CAA 

TCA TTG CAA GGC AAT CAA GGC AAT CAA CTC TCT GGG AAG 872 

CTC TCT GGG AAG 872 Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn 

Leu Gin Val Ala Thr Leu Ser Gin Leu Ser Gly Lys 

Leu Gin Gly Asn Gin Leu Ser 260 265 

Gly Lys 270 

260 265 ATT CCA TCA GTG ATT GGT CTC ATG CAA 

270 GCC CTT GCA GTC TTA GAT CTA 920 
ATT CCA TCA GTG ATT GGT 
CTC ATG CAA GCC CTT GCA 
GTC TTA GAT CTA 920 

He Pro Ser Val He Gly Leu Met lie Pro Ser Val lie Gly Leu Met Gin Ala Leu Ala 

Gin Ala Leu Ala Val Leu Asp Val Leu Asp Leu 

Leu 275 280 

275 280 285 290 

285 290 AGT GGC AAC TTG TTG AGT GGA TCT ATT 

AGT GGC AAC TTG TTG AGT CCT CCG ATT CTC GGA AAT CTT 968 

GGA TCT ATT CCT CCG ATT Ser Gly Asn Leu Leu Ser Gly Ser lie Pro Pro lie 

CTC GGA AAT CTT 968 Leu Gly Asn Leu 

Ser Gly Asn Leu Leu Ser Gly 

Ser lie Pro Pro lie Leu Gly Asn 

Leu 

295 295 300 

300 305 305 

ACT TTC ACC GAG AAA TTG ACT TTC ACC GAG AAA TTG TAT TTG CAC 

TAT TTG CAC AGT AAC AAG AGT AAC AAG CTG ACT GGT TCA 1016 

CTG ACT GGT TCA 1016 Thr Phe Thr Glu Lys Leu Tyr Leu His Ser Asn 

Thr Phe Thr Glu Lys Leu Tyr Lys Leu Thr Gly Ser 

Leu His Ser Asn Lys Leu Thr 310 315 

Gly Ser 320 
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ATT CCA CCT GAG CTT GGA ATT CCA CCT GAG CTT GGA AAC ATG TCA 

AAC ATG TCA AAA CTC CAT AAA CTC CAT TAC CTG GAA CTC 1 064 

TAC CTG GAA CTC 1064 lie Pro Pro Glu Leu Gly Asn Met Ser Lys Leu 

lie Pro Pro Glu Leu Gly Asn Met His Tyr Leu Glu Leu 



AAT GAT AAT CAT CTC ACG 
GGT CAT ATA CCA CCA GAG 
CTT GGG AAG CTT 1112 

Asn Asp Asn His Leu Thr Gly Asn Asp Asn His Leu Thr Gly His lie Pro Pro 

His He Pro Pro Glu Leu Gly Lys Glu Leu Gly Lys Leu 

Leu 340 345 

340 345 350 

350 ACT GAC TTG TTT GAT CTG AAT GTG GCC 

ACT GAC TTG TTT GAT CTG AAC AAT GAT CTG GAA GGA CCT 1 1 60 

AAT GTG GCC AAC AAT GAT Thr Asp Leu Phe Asp Leu Asn Val Ala Asn Asn 

CTG GAA GGA CCT 1160 Asp Leu Glu Gly Pro 
Thr Asp Leu Phe Asp Leu Asn 
Val Ala Asn Asn Asp Leu Glu 
Gly Pro 

355 360 355 360 

365 370 365 370 

ATA CCT GAT CAT CTG AGC ATA CCT GAT CAT CTG AGC TCT TGC ACA 

TCT TGC ACA AAT CTA AAC AAT CTA AAC AGC TTA AAT GTT 1 208 

AGC TTA AAT GTT 1208 lie Pro Asp His Leu Ser Ser Cys Thr Asn Leu 

lie Pro Asp His Leu Ser Ser Cys Asn Ser Leu Asn Val 

Thr Asn Leu Asn Ser Leu Asn 375 380 

Val 385 



Ser Lys Leu His Tyr Leu Glu 325 
Leu 335 



330 



325 

330 335 



AAT GAT AAT CAT CTC ACG GGT CAT ATA 
CCA CCA GAG CTT GGG AAG CTT 1112 



375 
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380 385 

CAT GGG AAC AAG TTT AGT CAT GGG AAC AAG TTT AGT GGC ACT ATA 

GGC ACT ATA CCC CGA GCA CCC CGAGCATTT CAAAAG CTA 1256 

TTT CAAAAG CTA 1 256 His Gly Asn Lys Phe Ser Gly Thr He Pro Arg Ala 

His Gly Asn Lys Phe Ser Gly Phe Gin Lys Leu 

Thr lie Pro Arg Ala Phe Gin Lys 390 395 

Leu 400 

390 GAA AGT ATG ACT TAC CTT AAT CTG TCC 

395 400 AGC AAC AAT ATC AAA GGT CCA 1304 

GAA AGT ATG ACT TAC CTT 
AAT CTG TCC AGC AAC AAT 
ATC AAA GGT CCA 1304 

Glu Ser Met Thr Tyr Leu Asn Glu Ser Met Thr Tyr Leu Asn Leu Ser Ser Asn 

Leu Ser Ser Asn Asn lie Lys Gly Asn He Lys Gly Pro 

Pro 405 410 

405 415 

410 415 ATC CCG GTT GAG CTA TCT CGT ATC GGT 

ATC CCG GTT GAG CTA TCT AAC TTA GAT ACA TTG GAT CTT 1 352 

CGT ATC GGT AAC TTA GAT He Pro Val Glu Leu Ser Arg lie Gly Asn Leu Asp 

ACA TTG GAT CTT 1 352 Thr Leu Asp Leu 
He Pro Val Glu Leu Ser Arg lie 
Gly Asn Leu Asp Thr Leu Asp 
Leu 

420 425 420 425 

430 430 

TCC AAC AAC AAG ATA AAT TCC AAC AAC AAG ATA AAT GGA ATC ATT 

GGA ATC ATT CCT TCT TCC CCT TCT TCC CTT GGT GAT TTG 1 400 

CTT GGT GAT TTG 1400 Ser Asn Asn Lys He Asn Gly He He Pro Ser Ser 

Ser Asn Asn Lys He Asn Gly lie Leu Gly Asp Leu 

lie Pro Ser Ser Leu Gly Asp Leu 435 440 

435 440 445 450 

445 450 
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GAG CAT CTT CTC AAG ATG GAG CAT CTT CTC AAG ATG AAC TTG AGT 

AAC TTG AGT AGA AAT CAT AGAAAT CAT ATA ACT GGT GTA 1448 

ATA ACT GGT GTA 1448 Glu His Leu Leu Lys Met Asn Leu Ser Arg Asn 

Glu His Leu Leu Lys Met Asn His lie Thr Gly Val 

Leu Ser Arg Asn His lie Thr Gly 455 460 

Val 465 

455 GTT CCA GGC GAC TTT GGA AAT CTA AGA 

460 465 AGC ATC ATG GAA ATA GAT CTT 1496 

GTT CCA GGC GAC TTT GGA 
AAT CTA AGA AGC ATC ATG 
GAA ATA GAT CTT 1496 

Val Pro Gly Asp Phe Gly Asn Val Pro Gly Asp Phe Gly Asn Leu Arg Ser lie 

Leu Arg Ser lie Met Glu lie Asp Met Glu lie Asp Leu 

Leu 470 475 

470 480 

475 480 TCA AAT AAT GAT ATC TCT GGC CCA ATT 

TCA AAT AAT GAT ATC TCT CCA GAA GAG CTT AAC CAA TTA 1 544 

GGC CCA ATT CCA GAA GAG Ser Asn Asn Asp lie Ser Gly Pro lie Pro Glu Glu 

CTT AAC CAA TTA 1 544 Leu Asn Gin Leu 
Ser Asn Asn Asp lie Ser Gly 
Pro lie Pro Glu Glu Leu Asn Gin 
Leu 

485 485 490 

490 495 495 

CAG AAC ATA ATT TTG CTG CAG AAC ATA ATT TTG CTG AGA CTG GAA 

AGA CTG GAA AAT AAT AAC AAT AAT AAC CTG ACT GGT AAT 1 592 

CTG ACT GGT AAT 1592 Gin Asn lie lie Leu Leu Arg Leu Glu Asn Asn 

Gin Asn He lie Leu Leu Arg Leu Asn Leu Thr Gly Asn 

Glu Asn Asn Asn Leu Thr Gly 500 505 

Asn 510 
500 505 

510 

GTT GGT TCA TTA GCC AAC GTT GGT TCA TTA GCC AAC TGT CTC AGT 
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TGT CTC AGT CTC ACT GTA CTC ACT GTA TTG AAT GTA TCT 1 640 
TTG AAT GTA TCT 1640 Val Gly Ser Leu Ala Asn Cys Leu Ser Leu Thr 
Val Gly Ser Leu Ala Asn Cys Val Leu Asn Val Ser 

Leu Ser Leu Thr Val Leu Asn 515 520 

Val Ser 525 530 

515 520 CAT AAC AAC CTC GTA GGT GAT ATC CCT 

525 530 AAG AAC AAT AAC TTC TCA AGA 1688 

CAT AAC AAC CTC GTA GGT 

GAT ATC CCT AAG AAC AAT 

AAC TTC TCA AGA 1 688 

His Asn Asn Leu Val Gly Asp lie His Asn Asn Leu Val Gly Asp lie Pro Lys Asn 
Pro Lys Asn Asn Asn Phe Ser Asn Asn Phe Ser Arg 

Arg 535 540 

535 545 

540 545 TTT TCA CCA GAC AGC TTC ATT GGC AAT 

TTT TCA CCA GAC AGC TTC CCT GGT CTT TGC GGT AGT TGG 1 736 
ATT GGC AAT CCT GGT CTT Phe Ser Pro Asp Ser Phe lie Gly Asn Pro Gly 
TGC GGT AGT TGG 1736 Leu Cys Gly Ser Trp 
Phe Ser Pro Asp Ser Phe He 
Gly Asn Pro Gly Leu Cys Gly 
Ser Trp 

550 550 555 

555 560 560 

CTA AAC TCA CCG TGT CAT CTA AAC TCA CCG TGT CAT GAT TCT CGT 
GAT TCT CGT CGA ACT GTA CGAACT GTA CGA GTG TCA ATC 1784 
CGA GTG TCA ATC 1784 Leu Asn Ser Pro Cys His Asp Ser Arg Arg Thr 
Leu Asn Ser Pro Cys His Asp Val Arg Val Ser He 

Ser Arg Arg Thr Val Arg Val Ser 565 570 
lie 575 
565 

570 575 

TCT AGA GCA GCT ATT CTT TCT AGA GCA GCT ATT CTT GGA ATA GCT 
GGA ATA GCT ATT GGG GGA ATT GGG GGA CTT GTG ATC CTT 1 832 
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CTT GTG ATC CTT 1832 Ser Arg Ala Ala lie Leu Gly lie Ala He Gly Gly 
Ser Arg Ala Ala He Leu Gly He Leu Val lie Leu 

Ala lie Gly Gly Leu Val He Leu 580 585 

580 585 590 

590 CTC ATG GTC TTA ATA GCA GCT TGC CGA 

CTC ATG GTC TTA ATA GCA CCG CAT AAT CCT CCT CCT TTT 1 880 
GCT TGC CGA CCG CAT AAT 
CCT CCT CCT TTT 1880 

Leu Met Val Leu He Ala Ala Cys Leu Met Val Leu He Ala Ala Cys Arg Pro His Asn 
Arg Pro His Asn Pro Pro Pro Pro Pro Pro Phe 

Phe 595 600 

595 600 605 610 

605 61 0 CTT GAT GGA TCA CTT GAC AAA CCA GTA 

CTT GAT GGA TCA CTT GAC ACT TAT TCG ACA CCG AAG CTC 1 928 

AAA CCA GTA ACT TAT TCG Leu Asp Gly Ser Leu Asp Lys Pro Val Thr Tyr 

ACA CCG AAG CTC 1 928 Ser Thr Pro Lys Leu 

Leu Asp Gly Ser Leu Asp Lys 

Pro Val Thr Tyr Ser Thr Pro Lys 

Leu 

615 615 620 

620 625 625 

GTC ATC CTT CAT ATG AAC GTC ATC CTT CAT ATG AAC ATG GCA CTC 
ATG GCA CTC CAC GTT TAC CAC GTT TAC GAG GAT ATC ATG 1 976 
GAG GAT ATC ATG 1 976 Val I le Leu His Met Asn Met Ala Leu His Val Tyr 
Val lie Leu His Met Asn Met Ala Glu Asp lie Met 

Leu His Val Tyr Glu Asp lie Met 630 635 

630 640 
635 640 

AGA ATG ACA GAG AAT CTA AGA ATG ACA GAG AAT CTA AGT GAG AAG 
AGT GAG AAG TAT ATC ATT TAT ATC ATT GGG CAC GGA GCA 2024 
GGG CAC GGA GCA Arg Met Thr Glu Asn Leu Ser Glu Lys Tyr He lie 
2024 Gly His Gly Ala 

Arg Met Thr Glu Asn Leu Ser 645 650 
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Glu Lys Tyr He He Gly His Gly 655 

Ala TCA AGC ACT GTA TAC AAA TGT GTT TTG 

645 AAG AAT TGT AAA CCG GTT GCG 2072 
650 655 
TCA AGC ACT GTA TAC AAA 
TGT GTT TTG AAG AAT TGT 
AAA CCG GTT GCG 2072 

Ser Ser Thr Val Tyr Lys Cys Val Ser Ser Thr Val Tyr Lys Cys Val Leu Lys Asn 

Leu Lys Asn Cys Lys Pro Val Cys Lys Pro Val Ala 

Ala 660 665 

660 665 670 

670 ATT AAG CGG CTT TAC TCT CAC AAC CCA 

ATT AAG CGG CTT TAC TCT CAG TCA ATG AAA CAG TTT GAA 21 20 

CAC AAC CCA CAG TCA ATG He Lys Arg Leu Tyr Ser His Asn Pro Gin Ser Met 

AAA CAG TTT GAA 21 20 Lys Gin Phe Glu 
lie Lys Arg Leu Tyr Ser His Asn 
Pro Gin Ser Met Lys Gin Phe 
Glu 

675 680 675 680 

685 690 685 690 

ACA GAA CTC GAG ATG CTA ACA GAA CTC GAG ATG CTA AGT AGC ATC 

AGT AGC ATC AAG CAC AGA AAG CAC AGAAAT CTT GTG AGC 2168 

AAT CTT GTG AGC 2168 Thr Glu Leu Glu Met Leu Ser Ser He Lys His Arg 

Thr Glu Leu Glu Met Leu Ser Asn Leu Val Ser 

Ser He Lys His Arg Asn Leu Val 695 700 

Ser 705 
695 

700 705 

CTA CAA GCT TAT TCC CTC CTA CAA GCT TAT TCC CTC TCT CAC TTG 

TCT CAC TTG GGG AGT CTT GGG AGT CTT CTG TTC TAT GAC 221 6 

CTG TTC TAT GAC 2216 Leu Gin Ala Tyr Ser Leu Ser His Leu Gly Ser 

Leu Gin Ala Tyr Ser Leu Ser His Leu Leu Phe Tyr Asp 

Leu Gly Ser Leu Leu Phe Tyr 710 715 
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Asp 720 

710 TAT TTG GAA AAT GGT AGC CTC TGG GAT 

715 720 CTT CTT CAT GGC CCT ACG AAG 2264 

TAT TTG GAA AAT GGT AGC 
CTC TGG GAT CTT CTT CAT 
GGC CCT ACG AAG 2264 

Tyr Leu Glu Asn Gly Ser Leu Tyr Leu Glu Asn Gly Ser Leu Trp Asp Leu Leu 
Trp Asp Leu Leu His Gly Pro His Gly Pro Thr Lys 

Thr Lys 725 730 

725 735 
730 735 AAA AAG ACT CTT GAT TGG GAC ACA CGG 

AAA AAG ACT CTT GAT TGG CTT AAG ATA GCA TAT GGT GCA 2312 
GAC ACA CGG CTT AAG ATA Lys Lys Thr Leu Asp Trp Asp Thr Arg Leu Lys lie 
GCA TAT GGT GCA 231 2 Ala Tyr Gly Ala 
Lys Lys Thr Leu Asp Trp Asp 
Thr Arg Leu Lys lie Ala Tyr Gly 
Ala 



740 745 740 745 

750 750 

GCA CAA GGT TTA GCT TAT GCA CAA GGT TTA GCT TAT CTA CAC CAT 
CTA CAC CAT GAC TGT AGT GAC TGT AGT CCA AGG ATC ATT 2360 
CCA AGG ATC ATT 2360 Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys 
Ala Gin Gly Leu Ala Tyr Leu His Ser Pro Arg He lie 

His Asp Cys Ser Pro Arg lie lie 755 760 
755 760 765 770 

765 770 



CAC AGA GAC GTG AAG TCG CAC AGA GAC GTG AAG TCG TCC AAC ATT 

TCC AAC ATT CTC TTG GAC CTC TTG GAC AAA GAC TTA GAG 2408 

AAA GAC TTA GAG 2408 His Arg Asp Val Lys Ser Ser Asn lie Leu Leu 

His Arg Asp Val Lys Ser Ser Asp Lys Asp Leu Glu 

Asn lie Leu Leu Asp Lys Asp 775 780 

Leu Glu 785 

775 GCT CGT TTG ACA GAT TTT GGA ATA GCG 
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780 785 AAAAGC TTG TGT GTG TCAAAG 2456 

GCT CGT TTG ACA GAT TTT 
GGA ATA GCG AAA AGC TTG 
TGT GTG TCAAAG 2456 



Ala Arg Leu Thr Asp Phe Gly lie Ala Arg Leu Thr Asp Phe Gly lie Ala Lys Ser Leu 

Ala Lys Ser Leu Cys Val Ser Cys Val Ser Lys 

Lys 790 795 

790 800 

795 800 TCA CAT ACT TCA ACT TAC GTG ATG GGC 

TCA CAT ACT TCA ACT TAC ACG ATA GGT TAC ATA GAC CCC 2504 

GTG ATG GGC ACG ATA GGT Ser His Thr Ser Thr Tyr Val Met Gly Thr He Gly 

TAC ATA GAC CCC 2504 Tyr lie Asp Pro 
Ser His Thr Ser Thr Tyr Val Met 
Gly Thr He Gly Tyr lie Asp Pro 



805 805 810 

810 815 815 

GAG TAT GCT CGC ACT TCA GAG TAT GCT CGC ACT TCA CGG CTC ACT 

CGG CTC ACT GAG AAA TCC GAG AAA TCC GAT GTC TAC AGT 2552 

GAT GTC TAC AGT 2552 Glu Tyr Ala Arg Thr Ser Arg Leu Thr Glu Lys Ser 

Glu Tyr Ala Arg Thr Ser Arg Leu Asp Val Tyr Ser 

Thr Glu Lys Ser Asp Val Tyr Ser 820 825 

820 825 830 

830 



TAT GGA ATA GTC CTT CTT TAT GGA ATA GTC CTT CTT GAG CTG TTA 

GAG CTGTTAACC CGAAGG ACC CGAAGG AAA GCC GTT GAT 2600 

AAA GCC GTT GAT 2600 Tyr Gly lie Val Leu Leu Glu Leu Leu Thr Arg Arg 

Tyr Gly He Val Leu Leu Glu Leu Lys Ala Val Asp 

Leu Thr Arg Arg Lys Ala Val Asp 835 840 

835 840 845 850 

845 850 GAC GAA TCC AAT CTC CAC CAT CTG ATA 

GAC GAA TCC AAT CTC CAC ATG TCAAAG ACG GGGAAC AAT 2648 

CAT CTG ATA ATG TCA AAG 

ACG GGG AAC AAT 2648 
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Asp Glu Ser Asn Leu His His 
Leu lie Met Ser Lys Thr Gly Asn 
Asn 

855 

860 865 
GAA GTG ATG GAA ATG GCA 
GAT CCA GAC ATC ACA TCG 
ACG TGT AAA GAT 2696 
Glu Val Met Glu Met Ala Asp 
Pro Asp lie Thr Ser Thr Cys Lys 
Asp 

870 

875 880 
CTC GGT GTG GTG AAG AAA 
GTT TTC CAA CTG GCA CTC 
CTA TGC ACC AAA 2744 
Leu Gly Val Val Lys Lys Val Phe 
Gin Leu Ala Leu Leu Cys Thr 
Lys 

885 

890 895 

AGA CAG CCG AAT GAT CGA 
CCC ACA ATG CAC CAG GTG 
ACT CGT GTT CTC 2792 
Arg Gin Pro Asn Asp Arg Pro 
Thr Met His Gin Val Thr Arg Val 
Leu 

900 905 

910 

GGC AGT TTT ATG CTA TCG 
GAA CAA CCA CCT GCT GCG 
ACT GAC ACG TCA 2840 



Asp Glu Ser Asn Leu His His Leu lie Met Ser 
Lys Thr Gly Asn Asn 

855 860 
865 

GAA GTG ATG GAA ATG GCA GAT CCA GAC 
ATC ACA TCG ACG TGT AAA GAT 2696 
Glu Val Met Glu Met Ala Asp Pro Asp lie Thr Ser 
Thr Cys Lys Asp 



870 875 
880 

CTC GGT GTG GTG AAG AAA GTT TTC CAA 
CTG GCA CTC CTA TGC ACC AAA 2744 
Leu Gly Val Val Lys Lys Val Phe Gin Leu Ala 
Leu Leu Cys Thr Lys 

885 890 
895 



AGA CAG CCG AAT GAT CGA CCC ACA ATG 
CAC CAG GTG ACT CGT GTT CTC 2792 
Arg Gin Pro Asn Asp Arg Pro Thr Met His Gin 
Val Thr Arg Val Leu 

900 905 
910 

GGC AGT TTT ATG CTA TCG GAA CAA CCA 
CCT GCT GCG ACT GAC ACG TCA 2840 
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Gly Ser Phe Met Leu Ser Glu Gly Ser Phe Met Leu Ser Glu Gin Pro Pro Ala 
Gin Pro Pro Ala Ala Thr Asp Thr Ala Thr Asp Thr Ser 

Ser 915 920 

915 920 925 930 

925 930 GCG ACG CTG GCT GGT TCG TGC TAC GTC 

GCG ACG CTG GCT GGT GAT GAG TAT GCAAAT CTC AAG 2888 

TCG TGC TAC GTC GAT GAG Ala Thr Leu Ala Gly Ser Cys Tyr Val Asp Glu Tyr 

TAT GCA AAT CTC AAG Ala Asn Leu Lys 

2888 

Ala Thr Leu Ala Gly Ser Cys Tyr 
Val Asp Glu Tyr Ala Asn Leu 
Lys 

935 935 940 

940 945 945 

ACT CCT CAT TCT GTC AAT ACT CCT CAT TCT GTC AAT TGC TCT TCC 
TGC TCT TCC ATG AGT GCT ATG AGT GCT TCT GAT GCT CAA 2936 
TCT GAT GCT CAA 2936 Thr Pro His Ser Val Asn Cys Ser Ser Met Ser 
Thr Pro His Ser Val Asn Cys Ala Ser Asp Ala Gin 

Ser Ser Met Ser Ala Ser Asp 950 955 
Ala Gin 960 
950 

955 960 



CTG TTT CTT CGG TTT GGA CTG TTT CTT CGG TTT GGA CAA GTT ATT 
CAA GTT ATT TCT CAG AAC TCT CAG AAC AGT GAG 2978 
AGT GAG 2978 Leu Phe Leu Arg Phe Gly Gin Val lie Ser Gin 

Leu Phe Leu Arg Phe Gly Gin Asn Ser Glu 

Val He Ser Gin Asn Ser Glu 965 970 

965 975 
970 975 TAGTTTTTCG TTAGGAGGAG AATCTTTAAA 

TAGTTTTTCG TTAGGAGGAG ACGGTATCTT TTCGTTGCGT TAAGCTGTTA 
AATCTTTAAA ACGGTATCTT 3038 
TTCGTTGCGT TAAGCTGTTA 
3038 
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GAAAAATTAA TGTCTCATGT GAAAAATTAA TGTCTCATGT AAAGTATTAT 
AAAGTATTAT GCACTGCCTT GCACTGCCTT ATTATTATTA GACAAGTGTG 
ATTATTATTA GACAAGTGTG 3098 

3098 TGGTGTGAAT ATGTCTTCAG ACTGGCACTT 

TGGTGTGAAT ATGTCTTCAG AGACTTCCAA AAAAAAAAAA AAAAAAAAAA 
ACTGGCACTT AGACTTCCAA 3158 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 
3158 3176 
AAAAAAAAAA AAAAAAAA 
3176 



[0 0 5 7] [0057] 

IE?iJ#-§- : 2 Sequence number: 2 

IB?iJcoft£ : 9295 Sequence length: 9295 

la^iJcDlti : WM. Sequence type: Nucleic acid 

: ~ >MH The number of strands: It is double stranded. 



Mi^lfDUW. : genomic DNA 



(Arabidopsis thaliana) 



Topology: Linear 

Type of sequence: genomic DNA 

Origin 

Organism-name: Arabidopsis 
(Arabidopsis thaliana) 



thaliana 



: any fcfT Strain name: Colombia 

IE?iJ(£>#lifc : Sequence characteristics : 

Wffli&^'tt^ : exon Symbol showing the characteristics: exon 

: 1803.. 1881 Location: 1803.1881 



Ifl^'JWNHifc : Sequence characteristics : 

1#tit^r^i"IB-^ : intron Symbol showing the characteristics: intron 

: 1882..2227 Location: 1882.2227 

lfi£iJcD#|iS[ : Sequence characteristics : 



#®£^1~f£-^- : exon Symbol showing the characteristics: exon 

: 2228..2366 Location: 2228.2366 

@fi^ij£0#® : Sequence characteristics : 
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«£irfffi-^: intron 

#SfI : 2367..2467 

|fl?IJ©#m : 
^n^-tm^ : intron 
: 2540..2643 

Ifi^lJW#m : 
¥fWL*mi-m^ : exon 
: 2468..2539 

mn<o®m ■ 

«r*&*1-|B*: exon 
: 2644..2715 

: 

W^m-tm^ : intron 

: 2716..2809 

i|#m^^-rf2f- : exon 
ftt±.\iLW. : 2810..2878 

W^m-rm^: intron 
: 2879..2968 

: exon 
: 2969.. 3040 

#m£^-ffE-5§- : intron 

: 3041. .3118 

: exon 

fttefcW. : 3119..3190 



Symbol showing the characteristics: intron 

Location: 2367.2467 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 2540.2643 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 2468.2539 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 2644.2715 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: . 2716.2809 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 2810.2878 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 2879.2968 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 2969.3040 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 3041.3118 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 3119.3190 
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#m^*1-f2-^ : intron 
: 31 91. .3266 

: 

ftmzm-ftm- : exon 
: 3267..3338 

<ft» : intron 

: 3339.. 3421 

^Kimfl-^: exon 
fffe&S : 3422..3493 

Wm^m-rm^-: intron 
: 3494..3586 

^WLZm-tB^ : exon 
: 3587..3655 

mmwfk : 

tftrnzm-tm^: intron 

#£{iffi : 3656..3740 

#m^^-rtfi-^: exon 
: 3741. .381 2 

m^m-rm^r : intron 
^{ig: 381 3..3888 



Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 3191.3266 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 3267.3338 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 3339.3421 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 3422.3493 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 3494.3586 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 3587.3655 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 3656.3740 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 3741.3812 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 3813.3888 

Sequence characteristics : 
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exon 

ft&l&W. : 3889..3960 

W»**"ME*: intron 

#£E&g : 3961.. 4048 

«r*«r*"t1B* : exon 
#£{£g : 4049.. 41 20 

Efli<&4*» : 

mk&&-tU*: intron 
=f¥£&g : 41 21. .4209 

W&Z^tfm^ : exon 
#£{ig : 4210.. 4281 
@cWl#® : 
ftmzm-tm^ : intron 

#ffifug : 4282.. 4349 

eai©«MK : 

exon 

#&{ig : 4350.. 4421 

la^ijow : 

<KMR«r**1E* : intron 
£F£M : 4422.. 4508 

mm<D&u ■ 

¥f'$.*m-tm^ : exon 
ft&tiLW ■ 4509.. 4580 

W&Zm-tm^r : intron 

#£<£g: 4581.. 4706 



Symbol showing the characteristics: exon 

Location: 3889.3960 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 3961.4048 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 4049.4120 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 4121.4209 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 4210.4281 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 4282.4349 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 4350.4421 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 4422.4508 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 4509.4580 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 4581.4706 
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^mzm-rm^: exon 
Ml: 4707..4778 

■. 

#m^^1-fB-^ : intron 
ftHitLW : 4779.. 4860 

#m£3H-?e-^: exon 
: 4861.. 4932 

#m^^-rfe-^: intron 

i?»g : 4933..5018 

mmm-tm^r: exon 
: 5019..5090 

W^^-rte-^: intron 
5091. .51 76 

mwwm ■. 

m^Zm-fB^r : exon 
: 5177..5248 
IB^iJW#lt : 
mmmi-m^: intron 

: 5249.. 541 2 

K£|J<D#® : 
#m^^-rie-^ : exon 
#£#B : 541 3.. 5481 

ifi^J<D#m : 
Wmzm-tm^ : intron 



Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 4707.4778 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 4779.4860 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 4861.4932 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 4933.5018 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 5019.5090 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 5091.5176 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 5177.5248 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 5249.5412 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 5413.5481 

Sequence characteristics : 

Symbol showing the characteristics: intron 
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5482..5576 

mw&® : 
ftrnzm-tm^ ■. exon 

#»g : 5577.. 5648 
mmzm-tm^: intron 
ft&iiLW- : 5649..5726 

la^ijco^m : 

«£^m^: exon 
fttEitLW : 5727.. 5800 

ftmzm-fm^ : intron 
#»g: 5801.. 5882 
ia?IJ<7)#® : 

W^^-Tfe-^- : exon 
: 5883.. 6011 

sa^jo^m : 

Wm^mirm^: exon 

»f]ig : 6096.. 6443 

#m^1-fS^: intron 
: 601 2.. 6095 

W^^i-fS-^- : intron 
: 6444.. 65 19 

mwwm ■ 

Wm&1r^: exon 
»M : 6520..6890 

mmw® : 



Location: 5482.5576 
Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 5577.5648 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 5649.5726 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 5727.5800 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 5801.5882 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 5883.6011 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 6096.6443 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 6012.6095 

Sequence characteristics : 

Symbol showing the characteristics: intron 

Location: 6444.6519 

Sequence characteristics : 

Symbol showing the characteristics: exon 
Location: 6520.6890 
Sequence characteristics : 
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ff&itiW. : 6891. .6974 



#a**-re# = exon 

: 6975.. 7328 



Location: 6891 .6974 

Sequence characteristics : 

Symbol showing the characteristics: exon 

Location: 6975.7328 



mm 



Sequence 



GAATTCAAAG GAATAAGCAT GAATTCAAAG GAATAAGCAT CGGAGACGAT 
CGGAGACGAT TTAATGTTAC TTAATGTTAC CTCTTGACGT ATTTATCCAA 
CTCTTGACGT ATTTATCCAA 60 



TTTATCCATT AAGCCACCAG TGATCATCAT CATCAACATA TAAATAACCA 
CCATAGCATC TGATCATCAT 120 

CATCAACATA TAAATAACCA AATTTGAAAT GAACAAAAGT CGAATTGGTG 



AATTTGAAAT GAACAAAAGT 180 
CGAATTGGTG ATATTGAAAA 
TCGAGTTCGT GAAATTGAGA 
180 

ATCGGATTGG TGAATTTGAA ATCGGATTGG TGAATTTGAA GAGAGATGCG 

GAGAGATGCG TGTACCGTTA TGTACCGTTA GGGAGGAGGA 

GGGAGGAGGA GGAGACGGGA 240 

GGAGACGGGA 240 GAGAAAAAAG GAGACGGAGA 

GAGAAAAAAG TAACTCGCCG GCTCTGTTTC 

GAGACGGAGA CATGGCGGAG GTGATAATGT 300 

TAACTCGCCG GCTCTGTTTC AGCTGCGCAC GTTAGCTTTT TGTGGTTTGA 

CATGGCGGAG GTGATAATGT GTTGGAGAAC AGTGGGAGGC 

300 TCACGGTAGC 360 

AGCTGCGCAC GTTAGCTTTT GTGGAGTGAC GACATTGGGG 

TGTGGTTTGA ATAACACCAG AGGCGTCTTA TCTCCGTTGG 

GTTGGAGAAC ACAAATTATT 420 

AGTGGGAGGC 

TCACGGTAGC 360 

GTGGAGTGAC 



60 



TTTATCCATT AAGCCACCAG CCATAGCATC 



120 



ATATTGAAAA TCGAGTTCGT GAAATTGAGA 
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GACATTGGGG ATAACACCAG 
AGGCGTCTTA TCTCCGTTGG 
ACAAATTATT 420 

ATTATGGCTA TGAACATTCA ATTATGGCTA TGAACATTCA ACATATAATT 
ACATATAATT TAATTAGAAA TAATTAGAAA TTTGCGGATG AAAAAGAGGT 
TTTGCGGATG AAAAAGAGGT 480 

480 AAACAATTGC AGAAATGGTT AAAAATATTA 

AAACAATTGC AGAAATGGTT ACGTTGTACA GCAAATGATA ATAAAAAGTG 
AAAAATATTA ACGTTGTACA 540 

GCAAATGATA ATAAAAAGTG TAACGTACAG TGTGTAAGGA ATGGAAAAAT 
540 AATAATTTGG GTTAAAATAA ATATGTAGTT 

TAACGTACAG TGTGTAAGGA 600 

ATGGAAAAAT AATAATTTGG TTCTAACTAT ATAGTACTTT TTGAGAAAAG 

GTTAAAATAA ATATGTAGTT ATAATATTAT GTGTATTTTT ATTGAAACAA 

600 660 

TTCTAACTAT ATAGTACTTT 

TTGAGAAAAG ATAATATTAT 

GTGTATTTTT ATTGAAACAA 

660 

ATAAATGATT TAACAAAAAA ATAAATGATT TAACAAAAAA AAAAAGAGAA 
AAAAAGAGAA GTTAAAATGA GTTAAAATGA AAAGGAATTA TTATTTTTTA 
AAAGGAATTA TTATTTTTTA 720 

720 AGTTCTTCCT TCTTTTGTTG GGCCTGTGAC 

AGTTCTTCCT TCTTTTGTTG CCTTTTAGTT TTAGTCCACT TCGTTCTCAA 
GGCCTGTGAC CCTTTTAGTT 780 

TTAGTCCACT TCGTTCTCAA AGCTTCAAAA TATTAATTTT GTGACAAACC 
780 GACCGGAGCC AACCAAACCG 

AGCTTCAAAA TATTAATTTT GTTAACATCC 840 

GTGACAAACC TAAAACCAAT CATATTTTAT TAAGTTTTGT 

GACCGGAGCC GTTGATGCTA AACCAAAAAT CATTGGCATG 

AACCAAACCG GTTAACATCC 900 
840 

TAAAACCAAT CATATTTTAT 
TAAGTTTTGT GTTGATGCTA 
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AACCAAAAAT CATTGGCATG 
900 

CATATTTCTA AATTTAGTAA CATATTTCTA AATTTAGTAA TAAACAAAAA 
TAAACAAAAA CACTTAGAAA CACTTAGAAA TCACACGTTC ACTATACTAA 
TCACACGTTC ACTATACTAA 960 

960 AAAACGTTGA CAAAAACACA ACAACTATAC 

AAAACGTTGA CAAAAACACA TAATAATTAA AGAAGAGAAA ACTGAACCAA 
ACAACTATAC TAATAATTAA 1020 

AGAAGAGAAA ACTGAACCAA ACTTTTTGTA AACTCCTGAA TTTAAATTAG 
1020 TAATTGAAGT AAGAAGATGA AGAAGAACAT 

ACTTTTTGTA AACTCCTGAA 1080 

TTTAAATTAG TAATTGAAGT GTTAAGCAAA CAAAAAAATT ACACTAAAAT 

AAGAAGATGA AGAAGAACAT CATATAAAAA TACATAATTA CAAAAGTACC 

1080 1140 

GTTAAGCAAA CAAAAAAATT 

ACACTAAAAT CATATAAAAA 

TACATAATTA CAAAAGTACC 

1140 

CATAAGATGG ATTTATTGAT CATAAGATGG ATTTATTGAT ATGGGTCATC 
ATGGGTCATC TGTGAAACAA TGTGAAACAA GCCACAGAGA 
GCCACAGAGA GACAAAGACT 1200 

GACAAAGACT 1200 CGTAAGTATT GGGCAACGAA 

CGTAAGTATT GGGCAACGAA AGCGACCTCC TTTATTCACC ACTGCCATTA 
AGCGACCTCC TTTATTCACC ACATGTTCTT 1260 

ACTGCCATTA ACATGTTCTT CTTCTCCTTC TTCTTCTACA TTTTATGACC 
1260 GTTTTACCCT TCAAGAGAGA GAAACAAAAT 

CTTCTCCTTC TTCTTCTACA 1320 

TTTTATGACC GTTTTACCCT CACTCCCTCT CACTCACTCT ATCTCTCTCT 

TCAAGAGAGA GAAACAAAAT TCTGCAAAGC TTCAGAACTC 

1320 TGGCAGAGAG 1380 

CACTCCCTCT CACTCACTCT 

ATCTCTCTCT TCTGCAAAGC 

TTCAGAACTC 

TGGCAGAGAG 1380 
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ATAAAAGATG ATGGGGTTTT ATAAAAGATG ATGGGGTTTT TAACTTTATC 
TAACTTTATC CTCCCCAAAT CTCCCCAAAT AATTCTTCTT CCCTTCATCT 
AATTCTTCTT CCCTTCATCT 1440 

1440 CTCTCTCTTA CACAACAGGT CCCTACATTT 

CTCTCTCTTA CACAACAGGT GTACAATCTC CTCTCTTTAA AGACTCTCTC 
CCCTACATTT GTACAATCTC 1500 

CTCTCTTTAA AGACTCTCTC TCTTTCTCTC TCCATCTCTA TCTTACTCTG 
1500 TATTTCTGTC GTCTGAGCAC TCAATGAAAC 

TCTTTCTCTC TCCATCTCTA 1560 

TCTTACTCTG TATTTCTGTC CACTGTAAAT TTCCGCCAGA ATTTGATGTG 
GTCTGAGCAC TCAATGAAAC ATGGAACGAT AAAAATCATT TTTTCTCGGT 
1560 1620 
CACTGTAAAT TTCCGCCAGA 
ATTTGATGTG ATGGAACGAT 
AAAAATCATT TTTTCTCGGT 
1620 

TAAAGTAAAA AAACAAAAAC 
AAATTTCTGT AGAAATCATA 
ATAAAAGAAA GAAAAAAAAT 
1680 

CTAATGTCGG TACATAATAC 
GGTTCTCTTC TTCTTCTCTA 
TCCTCTGTTT CTTCTTCATG 
1740 

GAGACTTGAA AGCTTTTAAA 
GTATATCTAA AAACGCAGTC 
GTTTTAAGAC TGTGTGTGAG 
1800 

AAATGGCTCT GTTTAGAGAT 
ATTGTTCTTC TTGGGTTTCT 
CTTCTGCTTG AGCTTAGTAG 
1860 

CTACTGTGAC CTACTGTGAC TTCAGAGGAG GGTCAGTTAT 



TAAAGTAAAA AAACAAAAAC AAATTTCTGT 
AGAAATCATA ATAAAAGAAA GAAAAAAAAT 
1680 

CTAATGTCGG TACATAATAC GGTTCTCTTC 
TTCTTCTCTA TCCTCTGTTT CTTCTTCATG 
1740 

GAGACTTGAA AGCTTTTAAA GTATATCTAA 
AAACGCAGTC GTTTTAAGAC 
TGTGTGTGAG 1800 

AAATGGCTCT GTTTAGAGAT ATTGTTCTTC 
TTGGGTTTCT CTTCTGCTTG AGCTTAGTAG 
1860 
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TTCAGAGGAG GGTCAGTTAT TATACTGATG CATGCTTCTT CAAGTTCAAG 

TATACTGATG CATGCTTCTT 1920 

CAAGTTCAAG 1920 ATTTTCGTCT TTTTGTTTTA TATTAGTGAA 

ATTTTCGTCT TTTTGTTTTA AAAAACTTAA AGATGAGATT TTTATATGAT 

TATTAGTGAA AAAAACTTAA 1980 

AGATGAGATT TTTATATGAT TTTTGAAGTT TCATTTGGTG AAAATGAGAT 

1980 CTGGGTACTT GTTATTTTCT ATTTTTGCTT 

TTTTGAAGTT TCATTTGGTG 2040 

AAAATGAGAT CTGGGTACTT TTTGTAATGG TTTTTTTTTA CTTGGTGGGT 

GTTATTTTCT ATTTTTGCTT CTTCTATAGA ATCAAAAGAA GCTTTGAATA 

2040 2100 

TTTGTAATGG TTTTTTTTTA 

CTTGGTGGGT CTTCTATAGA 

ATCAAAAGAA GCTTTGAATA 

2100 

AATTAGGGTT TGAGTTTTAT AATTAGGGTT TGAGTTTTAT TTTGTTTTCT 

TTTGTTTTCT TGGAAGTTGA TGGAAGTTGA ATTTTTAATC TTCTCAAGAA 

ATTTTTAATC TTCTCAAGAA 2160 

2160 CTGACAAATA TTTTTTTTTG TTTTTGTGCG 

CTGACAAATA TTTTTTTTTG TGTGTGTTAA TAAAATATCC TTAAAACAAA 

TTTTTGTGCG TGTGTGTTAA 2220 

TAAAATATCC TTAAAACAAA ATTAAAGGAG CAACGTTGCT GGAGATTAAG 

2220 AAGTCATTCA AAGATGTGAA CAATGTTCTT 

ATTAAAGGAG CAACGTTGCT 2280 

GGAGATTAAG AAGTCATTCA TATGACTGGA CAACTTCACC TTCTTCGGAT 

AAGATGTGAA CAATGTTCTT TATTGTGTCT GGAGAGGTGT GTCTTGTGAA 

2280 2340 

TATGACTGGA CAACTTCACC 

TTCTTCGGAT TATTGTGTCT 

GGAGAGGTGT 

GTCTTGTGAA 2340 

AATGTCACCT TCAATGTTGT AATGTCACCT TCAATGTTGT TGCTCTGTAA 

TGCTCTGTAA GTTTCTTCAT GTTTCTTCAT TCCTTTAGAT TACTATTACA 

TCCTTTAGAT TACTATTACA 2400 
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2400 GTGGTTTTTG GTGTTCTTGT GGGAAAAAGT 

GTGGTTTTTG GTGTTCTTGT TGTAATTTGT TTTGTGTGTG TTTTCTATGT 
GGGAAAAAGT TGTAATTTGT 2460 

TTTGTGTGTG TTTTCTATGT TTTGTAGTAA TTTGTCAGAT TTGAATCTTG 
2460 ATGGAGAAAT CTCACCTGCT ATTGGAGATC 

TTTGTAGTAA TTTGTCAGAT 2520 

TTGAATCTTG ATGGAGAAAT TCAAGAGTCT CTTGTCAATG TAACTGTTTC 

CTCACCTGCT ATTGGAGATC AACATTCACT GTAGCATGAA ATAAAGTATC 

2520 2580 

TCAAGAGTCT CTTGTCAATG 

TAACTGTTTC AACATTCACT 

GTAGCATGAA ATAAAGTATC 

2580 

TTACTTTAAT TCTATTCCAC TTACTTTAAT TCTATTCCAC TCTCTGAGTT 
TCTCTGAGTT GTGACTTTTG GTGACTTTTG TCTTCTGTTT TTTTCTAATG 
TCTTCTGTTT TTTTCTAATG 2640 

2640 TAGTGATCTG CGAGGTAATC GCTTGTCTGG 

TAGTGATCTG CGAGGTAATC ACAAATCCCT GATGAGATTG GTGACTGTTC 
GCTTGTCTGG ACAAATCCCT 2700 

GATGAGATTG GTGACTGTTC TTCTTTGCAA AACTTGTAAG AACAGTGATT 
2700 GGTGTTATTC TACCATTAAA CTTTTGTTCA 

TTCTTTGCAA AACTTGTAAG 2760 

AACAGTGATT GGTGTTATTC TAGAGGTTTT ATTTGATGAA GTGTGTTCAT 

TACCATTAAA CTTTTGTTCA GTTGTTTTTA ATTCAGAGAC TTATCCTTCA 

2760 2820 

TAGAGGTTTT ATTTGATGAA 

GTGTGTTCAT GTTGTTTTTA 

ATTCAGAGAC TTATCCTTCA 

2820 

ATGAATTAAG TGGTGACATA ATGAATTAAG TGGTGACATA CCGTTTTCGA 
CCGTTTTCGA TTTCGAAGTT TTTCGAAGTT GAAGCAACTT GAGCAGCTGT 
GAAGCAACTT 2880 

GAGCAGCTGT 2880 AAGTAGCTAG TTATTCTGCT ACTAGTCTTC 

AAGTAGCTAG TTATTCTGCT ATATGTCATT GCTAAAAATA TACTCACCAT 
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ACTAGTCTTC ATATGTCATT 2940 

GCTAAAAATA TACTCACCAT GTGGAATATG GATTTTTACT TTGTCCAGGA 
2940 TTCTGAAGAA TAACCAATTG ATAGGACCGA 

GTGGAATATG GATTTTTACT 3000 

TTGTCCAGGA TTCTGAAGAA TCCCTTCAAC ACTTTCACAG ATTCCAAACC 

TAACCAATTG ATAGGACCGA TGAAAATTCT GTATGTTCCC CATGATTCTT 

3000 3060 

TCCCTTCAAC ACTTTCACAG 

ATTCCAAACC TGAAAATTCT 

GTATGTTCCC CATGATTCTT 

3060 

ACATGTCTTA CTACTTTTAG ACATGTCTTA CTACTTTTAG CTATATAGGT 
CTATATAGGT GATCATACAT GATCATACAT GTGTAATTTC AATTGCAGGG 
GTGTAATTTC AATTGCAGGG 3120 

31 20 ACTTGGCACA GAATAAACTC AGTGGTGAGA 

ACTTGGCACA GAATAAACTC TACCAAGACT TATTTACTGG AATGAAGTTC 
AGTGGTGAGA TACCAAGACT 31 80 

TATTTACTGG AATGAAGTTC TTCAGTATCT GTAAGTGTCA ATGTTTTTTG 
3180 AAGTCTGTCA ATGTCTCTTC ATTACCCGGT 

TTCAGTATCT GTAAGTGTCA 3240 

ATGTTTTTTG AAGTCTGTCA GATAATTGTT GTACTATGAT GAGCAGTGGG 

ATGTCTCTTC ATTACCCGGT TTGCGAGGAA ACAACTTAGT CGGTAACATT 

3240 3300 

GATAATTGTT GTACTATGAT 

GAGCAGTGGG 

TTGCGAGGAA ACAACTTAGT 

CGGTAACATT 3300 

TCTCCAGATT TGTGTCAACT TCTCCAGATT TGTGTCAACT GACTGGTCTT 
GACTGGTCTT TGGTATTTGT TGGTATTTGT GAGTCTTCTT GCACATCTGA 
GAGTCTTCTT GCACATCTGA 3360 

3360 ATAGTATGAT GAGTTCTTTT GTAAATATCA 

ATAGTATGAT GAGTTCTTTT AATATCTGAC TTTGTTTTGA TATTGAATCA 
GTAAATATCA AATATCTGAC 3420 

TTTGTTTTGA TATTGAATCA GTGACGTAAG AAACAACAGT TTGACTGGTA 
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3420 GTATACCTGA GACGATAGGA AATTGCACTG 

GTGACGTAAG AAACAACAGT 3480 

TTGACTGGTA GTATACCTGA CCTTCCAGGT TTTGTATGTG CCTCTTTCTC 

GACGATAGGA AATTGCACTG TACTTCTAAA CATCATTACT GTAATTTGGG 

3480 3540 

CCTTCCAGGT TTTGTATGTG 

CCTCTTTCTC TACTTCTAAA 

CATCATTACT GTAATTTGGG 

3540 

TTACTTAAGA AAATCTACTT TTACTTAAGA AAATCTACTT AACTGGTTTG 
AACTGGTTTG CTTATTACGA CTTATTACGA ACTCAGGGAC TTGTCCTACA 
ACTCAGGGAC TTGTCCTACA 3600 

3600 ATCAGCTAAC TGGTGAGATC CCTTTTGACA 

ATCAGCTAAC TGGTGAGATC TCGGCTTCCT GCAAGTTGCA ACATTGTTAG 
CCTTTTGACA TCGGCTTCCT 3660 

GCAAGTTGCA ACATTGTTAG TTCTCACCTC TACTAATCTT TTGCTTTAAA 
3660 TTTTGGCTAG CCTTTGTTTT CTTTTAAAGA 

TTCTCACCTC TACTAATCTT 3720 

TTGCTTTAAA TTTTGGCTAG AGATCATTTT CTTATCTTAG ATCATTGCAA 

CCTTTGTTTT CTTTTAAAGA GGCAATCAAC TCTCTGGGAA GATTCCATCA 

3720 3780 

AGATCATTTT CTTATCTTAG 

ATCATTGCAA GGCAATCAAC 

TCTCTGGGAA GATTCCATCA 

3780 

GTGATTGGTC TCATGCAAGC GTGATTGGTC TCATGCAAGC 
CCTTGCAGTC TTGTAAGTAC CCTTGCAGTC TTGTAAGTAC TTTTCTTCTA 
TTTTCTTCTA ATCAATGAAG ATCAATGAAG 3840 

3840 CTACTTATAA CATTTTCATG AACTTAGGTT 

CTACTTATAA CATTTTCATG ATATGTTTTC TTTTACAGAG ATCTAAGTGG 
AACTTAGGTT ATATGTTTTC 3900 

TTTTACAGAG ATCTAAGTGG CAACTTGTTG AGTGGATCTA TTCCTCCGAT 
3900 TCTCGGAAAT CTTACTTTCA CCGAGAAATT 

CAACTTGTTG AGTGGATCTA 3960 
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TTCCTCCGAT TCTCGGAAAT GTAATTCTTT ACCTGTTTGT TTTCAGTTTG 

CTTACTTTCA CCGAGAAATT GAGTCAAATG TCATACCATG TTAATGATAG 

3960 4020 

GTAATTCTTT ACCTGTTTGT 

TTTCAGTTTG GAGTCAAATG 

TCATACCATG TTAATGATAG 

4020 



TGATTTATCT TTTTGGCTTT TGATTTATCT TTTTGGCTTT ATCTCTAGGT 
ATCTCTAGGT ATTTGCACAG ATTTGCACAG TAACAAGCTG ACTGGTTCAA 
TAACAAGCTG ACTGGTTCAA 4080 

4080 TTCCACCTGA GCTTGGAAAC ATGTCAAAAC 

TTCCACCTGA GCTTGGAAAC TCCATTACCT GTATGACCAA CCTTCTCTTC 
ATGTCAAAAC TCCATTACCT 4140 

GTATGACCAA CCTTCTCTTC ACTTCTCTTT TTGCATACAG TCACTACTAA 
4-I40 GTTGTGTTTC CTTATCAACT ATTTGTAAAA 

ACTTCTCTTT TTGCATACAG 4200 

TCACTACTAA GTTGTGTTTC TATTCATAGG GAACTCAATG ATAATCATCT 

CTTATCAACT ATTTGTAAAA CACGGGTCAT ATACCACCAG 

4200 AGCTTGGGAA 4260 

TATTCATAGG GAACTCAATG 

ATAATCATCT CACGGGTCAT 

ATACCACCAG AGCTTGGGAA 

4260 



GCTTACTGAC TTGTTTGATC GCTTACTGAC TTGTTTGATC TGTAAGTAGT 
TGTAAGTAGT TCTTCCTATG TCTTCCTATG CTTGACATGT TTTGATGTTC 
CTTGACATGT TTTGATGTTC 4320 

4320 TTATGCTTAT ATGAACTATG TACATATAGG 

TTATGCTTAT ATGAACTATG AATGTGGCCA ACAATGATCT GGAAGGACCT 
TACATATAGG AATGTGGCCA 4380 

ACAATGATCT GGAAGGACCT ATACCTGATC ATCTGAGCTC TTGCACAAAT 
4380 CTAAACAGCT TGTATGTATC TCTTTCTCTG 

ATACCTGATC ATCTGAGCTC 4440 

TTGCACAAAT CTAAACAGCT AAAACTTCTC ACTTGAATGT TCAAGATTGG 
TGTATGTATC TCTTTCTCTG TGCTTTATAT GATTTTGTGT CTCATTAATG 
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4440 

AAAACTTCTC 
TCAAGATTGG 
GATTTTGTGT 
4500 



4500 

ACTTGAATGT 
TGCTTTATAT 
CTCATTAATG 



TAATGTAGAA ATGTTCATGG TAATGTAGAA ATGTTCATGG GAACAAGTTT 
GAACAAGTTT AGTGGCACTA AGTGGCACTA TACCCCGAGC ATTTCAAAAG 
TACCCCGAGC ATTTCAAAAG 4560 

4560 CTAGAAAGTA TGACTTACCT GTAAGTATCG 

CTAGAAAGTA TGACTTACCT ACGCTGAGAA TTTCTCTAAT CTTATATAAT 
GTAAGTATCG ACGCTGAGAA 4620 

TTTCTCTAAT CTTATATAAT ATATAGTTCC ACAGCGTTTG TTTTTTCGAA 
4620 TTTCAAGTCA TTAACTACTG AGTTTTTGGT 

ATATAGTTCC ACAGCGTTTG 4680 

TTTTTTCGAA TTTCAAGTCA TGCCTTTGAT TTATCGGTTC AACCAGTAAT 

TTAACTACTG AGTTTTTGGT CTGTCCAGCA ACAATATCAA AGGTCCAATC 

4680 4740 

TGCCTTTGAT TTATCGGTTC 

AACCAGTAAT CTGTCCAGCA 

ACAATATCAA AGGTCCAATC 

4740 

CCGGTTGAGC TATCTCGTAT CCGGTTGAGC TATCTCGTAT CGGTAACTTA 
CGGTAACTTA GATACATTGT GATACATTGT AAGTGTTTCT TGTTTTCTGT 
AAGTGTTTCT TGTTTTCTGT 4800 

4800 GAAGTATACA TCATTATATG TGCCTTGTCT 

GAAGTATACA TCATTATATG CACATTTATT AAATTTAATG ACATTTGAAG 
TGCCTTGTCT CACATTTATT 4860 

AAATTTAATG ACATTTGAAG GGATCTTTCC AACAACAAGA TAAATGGAAT 
4860 CATTCCTTCT TCCCTTGGTG ATTTGGAGCA 

GGATCTTTCC AACAACAAGA 4920 

TAAATGGAAT CATTCCTTCT TCTTCTCAAG ATGTGAGCAT CCATAAGACC 
TCCCTTGGTG ATTTGGAGCA TCCAGTTTTA TTGTTTATTT CTAGCAAAAG 
4920 4980 
TCTTCTCAAG ATGTGAGCAT 
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CCATAAGACC TCCAGTTTTA 
TTGTTTATTT CTAGCAAAAG 
4980 

ATGAAAATGG TTTGTGAACT ATGAAAATGG TTTGTGAACT CTTGCATTCT 
CTTGCATTCT TGTTATAGGA TGTTATAGGA ACTTGAGTAG AAATCATATA 
ACTTGAGTAG AAATCATATA 5040 

5040 ACTGGTGTAG TTCCAGGCGA CTTTGGAAAT 

ACTGGTGTAG CTAAGAAGCA TCATGGAAAT GTAAGAAGTT 

TTCCAGGCGA CTTTGGAAAT 5100 

CTAAGAAGCA TCATGGAAAT AACTTCTATC TGCTTGGTTA GAGTTTTTTT 
GTAAGAAGTT 5100 CATTTATCTC AATTACTGTT CTGAATTTGT 

AACTTCTATC TGCTTGGTTA 5160 

GAGTTTTTTT CATTTATCTC GTGTTTGTGG TTGCAGAGAT CTTTCAAATA 

AATTACTGTT CTGAATTTGT ATGATATCTC TGGCCCAATT CCAGAAGAGC 

5160 5220 

GTGTTTGTGG TTGCAGAGAT 

CTTTCAAATA ATGATATCTC 

TGGCCCAATT 

CCAGAAGAGC 5220 

TTAACCAATT ACAGAACATA TTAACCAATT ACAGAACATA ATTTTGCTGT 
ATTTTGCTGT AAGCAATCTT AAGCAATCTT CCTCTTATCC CTTCCAAGCT 
CCTCTTATCC CTTCCAAGCT 5280 

5280 GTTAAGAAAT TGTTTTTGTA GAATGAAACT 

GTTAAGAAAT TGTTTTTGTA AAAACTCTGT ATACACAATA ATGAGGTCAC 
GAATGAAACT AAAACTCTGT 5340 

ATACACAATA ATGAGGTCAC TATAGTGTGA TCCAGGAACA TGTATTGGGT 
5340 TGGTGATCTA TCTAATGTTG TGTTTCTTAA 

TATAGTGTGA TCCAGGAACA 5400 

TGTATTGGGT TGGTGATCTA AATTGCTTGC AGGAGACTGG AAAATAATAA 
TCTAATGTTG TGTTTCTTAA CCTGACTGGT AATGTTGGTT CATTAGCCAA 
5400 5460 
AATTGCTTGC 

AGGAGACTGG AAAATAATAA 
CCTGACTGGT AATGTTGGTT 
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CATTAGCCAA 5460 

CTGTCTCAGT CTCACTGTAT CTGTCTCAGT CTCACTGTAT TGTAAGTAGG 
TGTAAGTAGG CACCTTTGGT CACCTTTGGT TCTGAAACAT TTTTTGTCCC 
TCTGAAACAT TTTTTGTCCC 5520 

5520 TCTTTGTGCA TCTTTTGCTA AGAATATAAC 

TCTTTGTGCA TCTTTTGCTA CCTGCAATCT TCACTAACTC TTATAGGAAT 
AGAATATAAC CCTGCAATCT 5580 

TCACTAACTC TTATAGGAAT GTATCTCATA ACAACCTCGT AGGTGATATC 
5580 CCTAAGAACA ATAACTTCTC AAGATTTTCA 

GTATCTCATA ACAACCTCGT 5640 

AGGTGATATC CCTAAGAACA CCAGACAGGT ATGGTAATTT AGCAGGTTTT 

ATAACTTCTC AAGATTTTCA GGTATTGTGC ATTTTGTTTT GTTTGCTAAT 

5640 5700 

CCAGACAGGT ATGGTAATTT 

AGCAGGTTTT GGTATTGTGC 

ATTTTGTTTT GTTTGCTAAT 

5700 

ATCTATGTTT ATGTTTTTGG ATCTATGTTT ATGTTTTTGG ATAAAGCTTC 
ATAAAGCTTC ATTGGCAATC ATTGGCAATC CTGGTCTTTG CGGTAGTTGG 
CTGGTCTTTG 5760 

CGGTAGTTGG 5760 CTAAACTCAC CGTGTCATGA TTCTCGTCGA 

CTAAACTCAC CGTGTCATGA ACTGTACGAG GTGATTACAT TCTTCTAAAA 
TTCTCGTCGA ACTGTACGAG 5820 

GTGATTACAT TCTTCTAAAA GCTTCCATTC ACAAAACCTA AGATAATTAA 
5820 AGCTCATGTT TCTATCCATG TTTTGTCTGC 

GCTTCCATTC ACAAAACCTA 5880 

AGATAATTAA AGCTCATGTT AGTGTCAATC TCTAGAGCAG CTATTCTTGG 

TCTATCCATG TTTTGTCTGC AATAGCTATT GGGGGACTTG TGATCCTTCT 

5880 5940 

AGTGTCAATC TCTAGAGCAG 

CTATTCTTGG AATAGCTATT 

GGGGGACTTG 

TGATCCTTCT 5940 
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CATGGTCTTA ATAGCAGCTT CATGGTCTTA ATAGCAGCTT GCCGACCGCA 

GCCGACCGCA TAATCCTCCT TAATCCTCCT CCTTTTCTTG ATGGATCACT 

CCTTTTCTTG ATGGATCACT 6000 

6000 TGACAAACCA GGTCTACTCT CCAAACCACT 

TGACAAACCA GGTCTACTCT TTACGAATGT TCTTCACCTA CAATGTAATC 

CCAAACCACT TTACGAATGT 6060 

TCTTCACCTA CAATGTAATC CAATAGTTAA TCCTTAAATT TCCTGGTGAC 

6060 ATCAGTAACT TATTCGACAC CGAAGCTCGT 

CAATAGTTAA TCCTTAAATT 6120 

TCCTGGTGAC ATCAGTAACT CATCCTTCAT ATGAACATGG CACTCCACGT 

TATTCGACAC CGAAGCTCGT TTACGAGGAT ATCATGAGAA TGACAGAGAA 

6120 6180 

CATCCTTCAT ATGAACATGG 

CACTCCACGT TTACGAGGAT 

ATCATGAGAA TGACAGAGAA 

6180 

TCTAAGTGAG AAGTATATCA TCTAAGTGAG AAGTATATCA TTGGGCACGG 

TTGGGCACGG AGCATCAAGC ACTGTATACA AATGTGTTTT 

AGCATCAAGC ACTGTATACA 6240 

AATGTGTTTT 6240 GAAGAATTGT AAACCGGTTG CGATTAAGCG 

GAAGAATTGT AAACCGGTTG GCTTTACTCT CACAACCCAC AGTCAATGAA 

CGATTAAGCG GCTTTACTCT 6300 

CACAACCCAC AGTCAATGAA ACAGTTTGAA ACAGAACTCG AGATGCTAAG 

6300 TAGCATCAAG CACAGAAATC TTGTGAGCCT 

ACAGTTTGAA ACAGAACTCG 6360 

AGATGCTAAG TAGCATCAAG ACAAGCTTAT TCCCTCTCTC ACTTGGGGAG 

CACAGAAATC TTGTGAGCCT TCTTCTGTTC TATGACTATT TGGAAAATGG 

6360 6420 

ACAAGCTTAT TCCCTCTCTC 

ACTTGGGGAG TCTTCTGTTC 

TATGACTATT TGGAAAATGG 

6420 

TAGCCTCTGG GATCTTCTTC TAGCCTCTGG GATCTTCTTC ATGGTAAGTC 

ATGGTAAGTC TCATCGCCAA TCATCGCCAA ACATAGAAAA TTATTTGAAT 
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ACATAGAAAA TTATTTGAAT 6480 

6480 CTTCTGTGAC ATAACAACTT GCTTGTGTGT 

CTTCTGTGAC ATAACAACTT TTTGTAAAGG CCCTACGAAG AAAAAGACTC 
GCTTGTGTGT TTTGTAAAGG 6540 

CCCTACGAAG AAAAAGACTC TTGATTGGGA CACACGGCTT AAGATAGCAT 
6540 ATGGTGCAGC ACAAGGTTTA GCTTATCTAC 

TTGATTGGGA CACACGGCTT 6600 

AAGATAGCAT ATGGTGCAGC ACCATGACTG TAGTCCAAGG ATCATTCACA 

ACAAGGTTTA GCTTATCTAC GAGACGTGAA GTCGTCCAAC 

6600 ATTCTCTTGG 6660 

ACCATGACTG TAGTCCAAGG 

ATCATTCACA GAGACGTGAA 

GTCGTCCAAC ATTCTCTTGG 

6660 



ACAAAGACTT ACAAAGACTT AGAGGCTCGT TTGACAGATT 

AGAGGCTCGT TTGACAGATT TTGGAATAGC GAAAAGCTTG TGTGTGTCAA 

TTGGAATAGC GAAAAGCTTG 6720 

TGTGTGTCAA 6720 AGTCACATAC TTCAACTTAC GTGATGGGCA 

AGTCACATAC TTCAACTTAC CGATAGGTTA CATAGACCCC GAGTATGCTC 

GTGATGGGCA CGATAGGTTA 6780 

CATAGACCCC GAGTATGCTC GCACTTCACG GCTCACTGAG 

6780 AAATCCGATG TCTACAGTTA TGGAATAGTC 

GCACTTCACG CTTCTTGAGT 6840 

GCTCACTGAG AAATCCGATG TGTTAACCCG AAGGAAAGCC 

TCTACAGTTA TGGAATAGTC GTTGATGACG AATCCAATCT CCACCATCTG 

CTTCTTGAGT 6840 GTTTGTTCTT 6900 

TGTTAACCCG 

AAGGAAAGCC 

GTTGATGACG AATCCAATCT 

CCACCATCTG GTTTGTTCTT 

6900 

TCTTGCCTAT CTCTCTCAGC TCTTGCCTAT CTCTCTCAGC TGCTCTGTTT 

TGCTCTGTTT AGGTCAAGTC AGGTCAAGTC CGTAATCTTG TTTTCATTGA 

CGTAATCTTG TTTTCATTGA 6960 
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6960 TTCACTTACA TCAGATAATG TCAAAGACGG 

TTCACTTACA TCAGATAATG GGAACAATGA AGTGATGGAA ATGGCAGATC 
TCAAAGACGG 7020 

GGAACAATGA AGTGATGGAA CAGACATCAC ATCGACGTGT AAAGATCTCG 
ATGGCAGATC 7020 GTGTGGTGAA GAAAGTTTTC 

CAGACATCAC ATCGACGTGT CAACTGGCAC 7080 

AAAGATCTCG TCCTATGCAC CAAAAGACAG CCGAATGATC 

GTGTGGTGAA GAAAGTTTTC GACCCACAAT GCACCAGGTG 

CAACTGGCAC 7080 ACTCGTGTTC 7140 

TCCTATGCAC CAAAAGACAG 

CCGAATGATC GACCCACAAT 

GCACCAGGTG 

ACTCGTGTTC 7140 



TCGGCAGTTT TATGCTATCG TCGGCAGTTT TATGCTATCG GAACAACCAC 

GAACAACCAC CTGCTGCGAC TGACACGTCA 

CTGCTGCGAC GCGACGCTGG 7200 

TGACACGTCA CTGGTTCGTG CTACGTCGAT GAGTATGCAA 

GCGACGCTGG 7200 ATCTCAAGAC TCCTCATTCT GTCAATTGCT 

CTGGTTCGTG CTACGTCGAT 7260 

GAGTATGCAA ATCTCAAGAC CTTCCATGAG TGCTTCTGAT GCTCAACTGT 

TCCTCATTCT GTCAATTGCT TTCTTCGGTT TGGACAAGTT ATTTCTCAGA 

7260 7320 

CTTCCATGAG TGCTTCTGAT ACAGTGAGTA GTTTTTCGTT AGGAGGAGAA 

GCTCAACTGT TTCTTCGGTT TCTTTAAAAC GGTATCTTTT CGTTGCGTTA 

TGGACAAGTT ATTTCTCAGA 7380 
7320 

ACAGTGAGTA GTTTTTCGTT 
AGGAGGAGAA TCTTTAAAAC 
GGTATCTTTT CGTTGCGTTA 
7380 

AGCTGTTAGA AAAATTAATG AGCTGTTAGA AAAATTAATG TCTCATGTAA 

TCTCATGTAA AGTATTATGC AGTATTATGC ACTGCCTTAT TATTATTAGA 

ACTGCCTTAT TATTATTAGA 7440 

7440 CAAGTGTGTG GTGTGAATAT GTCTTCAGAC 
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CAAGTGTGTG GTGTGAATAT TGGCACTTAG ACTTCCTATA AGTTCTTGCC 

GTCTTCAGAC TGGCACTTAG 7500 

ACTTCCTATA AGTTCTTGCC TATCTAAGTT TTTCTAAATT GGGTTATTCT 

7500 TGTAACATAT CTTAGATCTA GTACTCAACA 

TATCTAAGTT TTTCTAAATT 7560 

GGGTTATTCT TGTAACATAT CCACGTCACC ACCACAAAAG ATTTCTTATG 

CTTAGATCTA GTACTCAACA CTCAAAAACA TATACATAGA AAGAACCTTC 

7560 7620 

CCACGTCACC 

ACCACAAAAG ATTTCTTATG 

CTCAAAAACA TATACATAGA 

AAGAACCTTC 7620 

TAAACTACGA GAAACGTTTT TAAACTACGA GAAACGTTTT GCTATGTAGT 

GCTATGTAGT GTTATATGTC GTTATATGTC AACCACGTCT ATGAGAGTGC 

AACCACGTCT 7680 

ATGAGAGTGC 7680 AAACGATAGG TTAATAAGTT TTCTCACTTG 

AAACGATAGG TTAATAAGTT GCAATAAAAA TGATAAACAA ATATATTGTC 

TTCTCACTTG GCAATAAAAA 7740 

TGATAAACAA ATATATTGTC TGATTAATTT ATTTTATATA GTTTTTTTAT 

7740 AATTTCTTAT ATTAATTCGA ACTCATACAG 

TGATTAATTT ATTTTATATA 7800 

GTTTTTTTAT AATTTCTTAT CGCGTGAGAC TTTCTAGTTT AGTATAAAGT 

ATTAATTCGA ACTCATACAG ACGTATTTTT GCAAAATCAA AATCGTAAAT 

7800 7860 

CGCGTGAGAC TTTCTAGTTT 

AGTATAAAGT ACGTATTTTT 

GCAAAATCAA AATCGTAAAT 

7860 

ACATACATTT TAAAATGTTA ACATACATTT TAAAATGTTA AAAAAGATAA 

AAAAAGATAA ATCCGTACAC ATCCGTACAC CATTTAAAAA TGGCATTTTC 

CATTTAAAAA TGGCATTTTC 7920 

7920 CTAAGATTTT TTTCAAAAAA GGCATTTTAG 

CTAAGATTTT TTTCAAAAAA ACAAGAACTA ATTACTACAA CTAAAATCTA 

GGCATTTTAG ACAAGAACTA 7980 
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ATTACTACAA CTAAAATCTA CTAACTTTGG TTTTTATGTA TACATTTACG 
7980 AGAGTCTACA CAAAAAAAAT ACATAAAAGA 

CTAACTTTGG TTTTTATGTA 8040 

TACATTTACG AGAGTCTACA AGAAGTAGTA AATAATTAAA ACGTAAAAAA 

CAAAAAAAAT ACATAAAAGA AAAGACTTTT CAAGAAGGCA 

8040 GAAGAGTAGC 8100 

AGAAGTAGTA AATAATTAAA 

ACGTAAAAAA AAAGACTTTT 

CAAGAAGGCA 

GAAGAGTAGC 8100 

ACTGTTGTGC GATTGTAAAA ACTGTTGTGC GATTGTAAAA TCGTCTTGAT 
TCGTCTTGAT TGTTGTTTAT TGTTGTTTAT CCCACTGATA AGCCTACCCT 
CCCACTGATA AGCCTACCCT 8160 

8160 TTTCAAAACT TGTTCTAAGT TTAAATTCTA 

TTTCAAAACT TGTTCTAAGT TTTTTGAACA TGACATACAG TATAAGGCTT 
TTAAATTCTA TTTTTGAACA 8220 

TGACATACAG TATAAGGCTT TTTAAAGATA TCATCTTGAT TTTGTTTCTT 
8220 CCACAGGGAA GCCCTATCCT TTCTTACATA 

TTTAAAGATA TCATCTTGAT 8280 

TTTGTTTCTT CCACAGGGAA ATCTTTGTTA GATAATTTTT TATTATTTTC 

GCCCTATCCT TTCTTACATA AAAAAAAATA AAATTGAACA TAAGTTTTCT 

8280 8340 

ATCTTTGTTA GATAATTTTT 

TATTATTTTC AAAAAAAATA 

AAATTGAACA TAAGTTTTCT 

8340 

CAAAGTAATA TGTTCTAACA CAAAGTAATA TGTTCTAACA ATAATAAACA 
ATAATAAACA TAATATCATT TAATATCATT TTTTTGTTTT AAACTATAAA 
TTTTTGTTTT AAACTATAAA 8400 

8400 GGACTAACAT GGTAAAAAGT TGCAATATAT 

GGACTAACAT GGTAAAAAGT AAATGATAAT TTAAACTAAA AATTAGAATA 
TGCAATATAT AAATGATAAT 8460 

TTAAACTAAA AATTAGAATA TGGTAACTTT TTCTTCAACA ACATGCCACA 
8460 TTCGGCTACA TGTCCACTAG GAAGTGTTAT 
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TGGTAACTTT TTCTTCAACA 8520 

ACATGCCACA TTCGGCTACA TATAGAATCG TTAATGTTGG GTACGCTTAT 

TGTCCACTAG GAAGTGTTAT GAAATTATCA ATGTTTGCTT AAATCTATGC 

8520 8580 

TATAGAATCG TTAATGTTGG 

GTACGCTTAT GAAATTATCA 

ATGTTTGCTT AAATCTATGC 

8580 

TTAGAAAATT ACCAATATTA TTAGAAAATT ACCAATATTA CCTTAAAACT 
CCTTAAAACT ATATTTACGA ATATTTACGA ATGACCAATA TTGCTTAGAA 
ATGACCAATA TTGCTTAGAA 8640 

8640 CTATGCTTAT GAAATTACCA ATATTTTCTT 

CTATGCTTAT GAAATTACCA AAAACTTAAA CACAAAACTC TTTAACAAAA 
ATATTTTCTT AAAACTTAAA 8700 

CACAAAACTC TTTAACAAAA AAAACTTTAT TTTTATTTTT ATTTTTTTGG 
8700 CAAAAAAAAA AACTTTATTT ATAAAGTGAA 

AAAACTTTAT TTTTATTTTT 8760 

ATTTTTTTGG CAAAAAAAAA AGTCTCCAGA TAATTTTGAA TTTCATTTTT 

AACTTTATTT ATAAAGTGAA CCAGTTTTTA TTTAGAATAA TTTTTCTTCA 

8760 8820 

AGTCTCCAGA TAATTTTGAA 

TTTCATTTTT CCAGTTTTTA 

TTTAGAATAA TTTTTCTTCA 

8820 

TTTACAAAAT AAAAGAAAAC TTTACAAAAT AAAAGAAAAC CCTAGGGTTT 
CCTAGGGTTT AGGGTTTAGG AGGGTTTAGG GTTTAGGAAA AAGCGATGAT 
GTTTAGGAAA AAGCGATGAT 8880 

8880 ATATTAATTG TTATGAAATG TTTTTTTAAA 

ATATTAATTG TTATGAAATG AATAGTTAAC CAAACATTTT TTTAAAGAGA 
TTTTTTTAAA AATAGTTAAC 8940 

CAAACATTTT TTTAAAGAGA GTTTAGTTTC ACAAGGCATT TGTAAATTAG 
8940 AGTAATTATC AATAAAAATG GAAGACAATC 

GTTTAGTTTC ACAAGGCATT 9000 

TGTAAATTAG AGTAATTATC TAATTATTAT TTAGCAAAAA CTATATTTAG 
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9000 9060 

TAATTATTAT TTAGCAAAAA 

CTATATTTAG GAAAATTAGT 

TAAAGTTTAG AAATATATCA 

9060 



TCATAGTGTC 
AAATTATTTA 
ATACGTGATC 
9120 

ATGAATATTT 
TACATGTAAC 
TAAATTTAGA 
9180 

AAATAATCAT 
GATTCAACTT 
TCCAAGAAAA 
9240 

TTTGTTTTGT 
TGACATCTAT 
GAAACCAAAT 
9295 



AAACTAATTA 
ATTTTGTGAT 
ATATAATTTT 

AATATTATGA 
TCAGTAAACC 
AGAAAAGTCA 

AACCAATTTA 
CTACTTTTGT 
AAACACATGG 

GGGATACTAA 
CAAAATCTAT 
CTAGA 



TCATAGTGTC 
ATTTTGTGAT 
9120 

ATGAATATTT 

TCAGTAAACC 

9180 

AAATAATCAT 
CTACTTTTGT 
9240 

TTTGTTTTGT 
CAAAATCTAT 
9295 



AAACTAATTA AAATTATTTA 
ATACGTGATC ATATAATTTT 

AATATTATGA TACATGTAAC 
TAAATTTAGA AGAAAAGTCA 

AACCAATTTA GATTCAACTT 
TCCAAGAAAA AAACACATGG 

GGGATACTAA TGACATCTAT 
GAAACCAAAT CTAGA 



mmm^r 3 

%m<D&& : 18 

mow : -#* 



Sequence number: 3 

Sequence length: 18 

Sequence type: Nucleic acid 

The number of strands: Single strand 



ss^ijoii : m<Dmm 

DNA 

T^f-ky* : NO 

mm : 



Topology: Linear 

Type of sequence: Other nucleic acid 
Synthetic DNA 
Antisense: NO 
Sequence : 



TATCTAAAAA CGCAGTCG TATCTAAAAA CGCAGTCG 18 
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18 Sequence number: 4 

ia^J#-^- : 4 Sequence length: 18 

IE£iJCD;ft£ : 18 Sequence type: Nucleic acid 

%m<Dm : mm 



Hcolfc : — The number of strands: Single strand 

h tH o v 5 — : Topology: Linear 

gfi^Jtf)®® : {&(D1&M Type of sequence: Other nucleic acid 

DNA Synthetic DNA 

T l'* : YES Antisense: YES 



§E£iJ : Sequence : 

AAGATTCTCC TCCTAACG AAGATTCTCC TCCTAACG 18 

18 



[mm<DffiW£MW] [BRIEF DESCRIPTION OF THE DRAWINGS] 

[01] [FK3.1] 

MMMX'&btitzJfZI&fflWA The restriction enzyme map of the form 

fc=f-9 p — y<DM\®&B$Mo regulatory-gene clone obtained in the Example. 



[12] [FIG 2] 

MW&^VbMlkTf-f'fVf&l&X The figure showing the T-DNA insertion site on 

£tlfc&fiftDNA$r the chromosomal DNA fragment presumed from 

Jt±<£> T-DNA ^ASMSSr^'t" a restriction enzyme map and Southern 

(2) 0 analysis. 



[13 3] [FIG 3] 

T-DNA ffiAoMiEififf ifl^'JcD The figure showing the position of the subclone 
•fzff p — ^(D-fiif &r^"t~ ®o of a sequence near the T-DNA insertion site. 
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2807 



fa 



61H10 



[FIG 1] 
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Hi ill si i lib i 

I ■ ■ 1 ,1 I LJ 1 L-LU -i- 
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[H2] 



[Fia 2] 



S ill ! 



f?AT-DNA 



| 1 I 

i 



--LU 1 - I I 



I I ■ I Ml- 



! i pRXb 



j pREb 



pRXa 



pREa 



lOkb 



Jf A: Insert 

UTs^czl— -fy^^ K: Rescue plasmid 
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[FIG 3] 



T-DNA»ASMt I*V> 



III ill, 

III 1 1 — I — ] 




s i Hi 
i in. 










• 1 6US4.5 
Q1EN1.B 
\ 2BW1.4 


; |61X3.4 

1 1 
1 1 



0 5 lOkb 
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T-DNA }f A£Mi: T-DNA inserting portion 
y{ y h p >: Intron 
x^yy; Exon 
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